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Preface 



Does the world really need a new textbook on general relativity? I feel that my first 
duty in presenting this book should be to provide a convincing affirmative answer 
to this question. 

There is already a vast array of available books. I will not attempt here to 
make an exhaustive list, but I will mention three of my favorites. For its unsur- 
passed pedagogical presentation of the elementary aspects of general relativity, I 
like Schutz' A first course in general relativity. For its unsurpassed completeness, I 
like Gravitation, by Misner, Thorne, and Wheeler. And for its unsurpassed elegance 
and rigour, I like Wald's General relativity. In my view, a serious student could do 
no better than start with Schutz for an outstanding introductory course, then move 
on to Misner, Thorne, and Wheeler to get a broad coverage of many different topics 
and techniques, and then finish off with Wald to gain access to the more modern 
topics and the mathematical standard that Wald has since imposed on this field. 
This is a long route, but with this book I hope to help the student along: I see my 
place as being somewhere between Schutz and Wald — more advanced than Schutz 
but less sophisticated than Wald — and I cover some of the few topics that are not 
handled by Misner, Thorne, and Wheeler. 

In the winter of 1998 I was given the responsibility of creating an advanced 
course in general relativity. The course was intended for graduate students working 
in the Gravitation Group of the Guelph- Waterloo Physics Institute, a joint graduate 
program in Physics shared by the Universities of Guelph and Waterloo. I thought 
long and hard before giving the first offering of this course, in an effort to round 
up the most useful and interesting topics, and to create the best possible course. 
I came up with a few guiding principles. First, I wanted to let the students in on 
a number of results and techniques that are part of every relativist's arsenal, but 
are not adequately covered in the popular texts. Second, I wanted the course to be 
practical, in the sense that the students would learn how to compute things, not 
just a bunch of abstract concepts. And third, I wanted to put these techniques to 
work in a really cool application of the theory, so that this whole enterprise would 
seem to have purpose. 

As I developed the course it became clear that it would not match the material 
covered in any of the existing textbooks; to meet my requirements I would have to 
form a synthesis of many texts, I would have to consult review articles, and I would 
have to go to the technical literature. This was a long but enjoyable undertaking, 
and I learned a lot. It gave me the opportunity to homogenize the various separate 
treatments, consolidate the various different notations, and present this synthesis as 
a unified whole. During this process I started to type up lecture notes that would 
be distributed to the students. These have evolved into this book. 

In the end, the course was designed around my choice of "really cool application" . 
There was no contest: the immediate winner was the mathematical theory of black 
holes, surely one of the most elegant, successful, and relevant applications of general 
relativity. This is covered in Chapter 5 of this book, which offers a thorough review 
of the solutions to the Einstein field equations that describe isolated black holes, 
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a description of the fundamental properties of black holes that are independent of 
the details of any particular solution, and an introduction to the four laws of black- 
hole mechanics. In the next paragraphs I outline the material covered in the other 
chapters, and describe the connections with the theory of black holes. 

The most important aspect of black-hole spacetimes is that they contain an 
event horizon, a null hypersurface that marks the boundary of the black hole and 
shields external observers from events going on inside. On this hypersurface there 
runs a network (or congruence) of nonintersecting null geodesies; these are called 
the null generators of the event horizon. To understand the behaviour of the horizon 
as a whole it proves necessary to understand how the generators themselves behave, 
and in Chapter 2 of this book we develop the relevant techniques. The description 
of congruences is concerned with the motion of nearby geodesies relative to a given 
reference geodesic; this motion is described by a deviation vector that lives in a space 
orthogonal to the reference geodesic's tangent vector. This transverse space is easy 
to construct when the geodesies are timelike, but the case of null geodesies is subtle. 
This has to do with the fact that the transverse space is then two-dimensional — the 
null vector tangent to the generators is orthogonal to itself and this direction must 
be explicitly removed from the transverse space. I show how this is done in Chapter 
2. While null congruences are treated in other textbooks (most notably in Wald), 
the student is likely to find my presentation [which I have adapted from Carter 
(1979)] better suited for practical computations. While Chapter 2 is concerned 
mostly with congruences of null geodesies, I present also a complete treatment of 
the timelike case. There are two reasons for this. First, this forms a necessary 
basis to understand the subtleties associated with the null case. Second, and more 
importantly, the mathematical techniques involved in the study of congruences of 
timelike geodesies are used widely in the general relativity literature, most notably 
in the field of mathematical cosmology. Another topic covered in Chapter 2 are 
the standard energy conditions of general relativity; these constraints on the stress- 
energy tensor ensure that under normal circumstances, gravity acts as an attractive 
force — it tends to focus geodesies. Energy conditions appear in most theorems 
governing the behaviour of black holes. 

Many quantities of interest in black-hole physics are defined by integration over 
the event horizon. An obvious example is the hole's surface area. Another example 
is the gain in mass of an accreting black hole; this is obtained by integrating a 
certain component of the accreting material's stress-energy tensor over the event 
horizon. These integrations require techniques that are introduced in Chapter 3 of 
this book. In particular, we shall need a notion of surface element on the event 
horizon. If the horizon were a timelike or a spacelike hypersurface, the construction 
of a surface element would pose no particular challenge, but once again there are 
interesting subtleties associated with the null case. I provide a complete treatment 
of these issues in Chapter 3; I believe that my presentation is more systematic, 
and more practical, than what can be found in the popular textbooks. Other 
topics covered in Chapter 3 include the initial-value problem of general relativity 
(which involves the induced metric and extrinsic curvature of a spacelike hypersur- 
face), and the Darmois-Lanczos-Israel-Barrabes formalism for junction conditions 
and thin shells (which constrains the possible discontinuities in the induced metric 
and extrinsic curvature). The initial- value problem is discussed at a much deeper 
level in Wald, but I felt it was important to include this material here: it provides 
a useful illustration of the physical meaning of the extrinsic curvature, an object 
that plays an important role in Chapter 4 of this book. Junction conditions and 
thin shells, on the other hand, are not covered adequately in any textbook, in spite 
of the fact that the Darmois-Lanczos-Israel-Barrabes formalism is used very widely 
in the literature. (Junction conditions and thin shells are touched upon in Misner, 
Thorne, and Wheeler, but I find that their treatment is too brief to do justice to 
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the formalism.) 

Among the most important quantities characterizing black holes are their mass 
and angular momentum, and the question arises as to how the mass and angular 
momentum of an isolated object is to be defined in general relativity. I find that 
the most compelling definitions come from the gravitational Hamiltonian, whose 
value for a given solution to the Einstein field equation depends on a specifiable 
vector field. If this vector corresponds to a time translation at spatial infinity, then 
the Hamiltonian gives the total mass of the spacetime; if, on the other hand, the 
vector corresponds to an asymptotic rotation about an axis, then the Hamiltonian 
gives the spacetime's total angular momentum in the direction of this axis. This 
connection is both deep and beautiful, and in this book it forms the starting point 
for defining black-hole mass and angular momentum. Chapter 4 of this book is 
devoted to a systematic treatment of the Lagrangian and Hamiltonian formulations 
of general relativity, with this goal in mind of arriving at well-motivated notions of 
mass and angular momentum. What sets my presentation apart from what can be 
found in other texts, including Misner, Thorne, and Wheeler and Wald, is that I pay 
careful attention to the "boundary terms" that must be included in the gravitational 
action to produce a well-posed variational principle. These boundary terms have 
been around for a very long time, but it is only fairly recently that their importance 
has been fully recognized. In particular, they are directly involved in defining the 
mass and angular momentum of an asymptotically-flat spacetime. 

To set the stage, I review the fundamentals of differential geometry in Chapter 
1 of this book. The collection of topics is standard: vectors and tensors, covari- 
ant differentiation, geodesies, Lie differentiation, Killing vectors, curvature tensors, 
geodesic deviation, and a few others. The goal here is not to provide an introduction 
to these topics; although some may be new, I assume that for the most part, the 
student will have encountered them before (in an introductory course at the level 
of Schutz, for example). Instead, my objective with this Chapter is to refresh the 
student's memory and establish the style and notation that I use throughout the 
book. 

As I have indicated, I have tried to present this material as a unified whole, 
using a consistent notation and maintaining a fairly uniform level of precision and 
rigour. While I have tried to be somewhat precise and rigourous, I have deliberately 
avoided putting too much emphasis on this. My attitude is that it is more important 
to illustrate how a theorem works and can be used in a practical situation, than 
it is to provide all the fine print that goes into a rigourous proof. The proofs that 
I do provide are informal; they may sometimes be incomplete, but they should be 
sufficient to convince the student that the theorems are true. They may, however, 
leave the student wanting for more; in this case I shall have to refer her to a more 
authoritative text such as Wald. 

I have also indicated that I wanted this book to be practical — I hope that after 
studying this book, the student will be able to use what she has learned to compute 
things of direct relevance to her. To help with this purpose I have inserted a large 
number of examples within the text. I also provide problem sets at the end of each 
chapter; here the student's understanding will be put to the test. The problems 
vary in difficulty, from the plug-and-grid type designed to make the student familiar 
with a new technique, to the more challenging type that is supposed to make the 
student think. Some of the problems require a large amount of tensor algebra, 
and I strongly encourage the student to let the computer perform the most routine 
operations. (My favourite package for tensor manipulations is GRTensorll, available 
free of charge at http : / / grtensor . phy . queensu . ca/.) 

Early versions of this book have been used by graduate students who took my 
course over the years. A number of them have expressed great praise by involving 
some of the techniques covered here in their own research. This is extraordinarily 



xii 



Preface 



gratifying, and it has convinced me that a wider release of this book might do more 
than just service my vanity. A number of students have carefully checked through 
the manuscript for errors (typographical or otherwise), and some have made useful 
suggestions for improvements. For this I thank Daniel Bruni, Sean Crowe, Luis de 
Menezes, Paul Kobak, Karl Martel, Sanjeev Seahra, and Katrin Rohlf. Of course, I 
accept full responsibility for whatever errors remain. The reader is invited to report 
any error she may find (poisson@physics.uoguelph.ca), and can look up those 
already reported at http://www.physics.uoguelph.ca/poisson/book/. 

This book is dedicated to Werner Israel, my teacher, mentor, and friend, whose 
influence on me, both as a relativist and as a human being, runs deep. His influence, 
I trust, will be felt throughout the book. Every time I started the elaboration of 
a new topic I would ask myself: "How would Werner approach this?". I do not 
believe for one second that the answers I came up with would even come close to 
his level of pedagogical excellence, but there is no doubt that to ask the question 
has made me try harder to reach that level. 
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We use the sign conventions of Misner, Thorne, and Wheeler (1973), with a metric 
of signature (—1, 1, 1, 1), a Riemann tensor defined by R a pjg = ^ a ps n + ■ ■ ■, and a 
Ricci tensor defined by R a p = R^^p- Greek indices (a, (3, . . . ) run from to 3, 
lower-case latin indices (a, b, ... ) run from 1 to 3, and upper-case latin indices (A, 
B, . . . ) run from 2 to 3. Geometrized units, in which G = c = 1, are employed. 
Here's a list of frequently occurring symbols: 



Symbol 



Description 



y a 



dx a /dy a , e a A = dx a /88 y 



9a/3 
h a b 

oab = g a pe A e" B 
g, h, a 

A {aP) =|(-4a/3 + Ap a ) 

A [aP\ = 2 ( A <*P ~ A P<*) 

pa 

1 Pi 

1 6c 

RaP-yS, RaP, R 
Rabcdi Rabi R 
V>,a = 9 a 1p 
1p,a = d a 1p 

A a p = VpA a 
A^ b = D b A a 
£ u A a 

d£ M = £ Ma/ 3 7 efe^ej d 3 y 

dSft, v = EjxvaP 6" e 3 d 2 9 

n a 

£ = n a n a 

K ab = n a -p e£ef 

6, a a p, w a p 

dVi 2 = d8 2 + sin 2 6 d<t> 2 



Arbitrary coordinates on manifold Jt 

Arbitrary coordinates on hypersurface £ 

Arbitrary coordinates on two-surface S 

Equals in specified coordinates 

Holonomic basis vectors 

Orthonormal basis vectors 

Metric on M 

Induced metric on S 

Induced metric on S 

Metric determinants 

Symmetrization 

Antisymmetrization 

Christoffel symbols constructed from g a p 

Christoffel symbols constructed from /i (, 

As constructed from g a p 

As constructed from /i f, 

Partial differentiation with respect to x a 

Partial differentiation with respect to y a 

Covariant differentiation ((^-compatible) 

Covariant differentiation (/i f,-compatible) 

Lie derivative of A" along u a 

Killing vector: £^g a p = 

Permutation symbol 

Levi-Civita tensor 

Directed surface element on £ 

Directed surface element on S 

Unit normal on £ (if timelike or spacelike) 

+1 if £ is timelike, — 1 if £ is spacelike 

Extrinsic curvature of £ 

Expansion, shear, and rotation 

Line element on unit two-sphere 
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Chapter 1 
Fundamentals 



This first chapter is devoted to a brisk review of the fundamentals of differential 
geometry. The collection of topics presented here is fairly standard, and most of 
these topics should have been encountered in a previous introductory course on 
general relativity. Some, however, may be new, or may be treated here from a 
different point of view, or with an increased degree of completeness. 

We begin in Sec. 1.1 by providing definitions for tensors on a differentiable 
manifold. The point of view adopted here, and throughout the text, is entirely 
unsophisticated: We do without the abstract formulation of differential geometry 
and define tensors in the old-fashioned way, in terms of how their components 
transform under coordinate transformations. While the abstract formulation (in 
which tensors are defined as multilinear mappings of vectors and dual vectors into 
real numbers) is decidedly more elegant and beautiful, and should be an integral part 
of an education in general relativity, the old approach has the advantage of economy, 
and this motivated its adoption here. Also, the old-fashioned way of defining tensors 
produces an immediate distinction between tensor fields in spacetime (four-tensors) 
and tensor fields on a hypersurface (three-tensors); this distinction will be important 
in later chapters of this book. 

Covariant differentiation is reviewed in Sec. 1.2, Lie differentiation in Sec. 1.4, 
and Killing vectors are introduced in Sec. 1.5. In Sec. 1.3 we develop the mathemat- 
ical theory of geodesies. The theory is based on a variational principle and employs 
an arbitrary parameterization of the world line. The advantage of this approach 
(over one in which geodesies are defined by parallel transport of the tangent vector) 
is that the limiting case of null geodesies can be treated more naturally. Also, it is 
often convenient, especially with null geodesies, to use a parameterization that is 
not affine; we will do so in later portions of this book. 

In Sec. 1 .6 we review a fundamental theorem of differential geometry, the local 
flatness theorem. Here we prove the theorem in the standard way, by counting the 
number of functions required to go from an arbitrary coordinate system to a locally 
Lorentzian frame. In Sec. 1.11 we extend the theorem to an entire geodesic, and we 
prove it by erecting Fermi normal coordinates in a neighbourhood of this geodesic. 

Useful results involving the determinant of the metric tensor are derived in 
Sec. 1.7. The metric determinant is used in Sec. 1.8 to define the Levi-Civita tensor, 
which will be put to use in later parts of this book (most notably in Chapter 3). 
The Riemann curvature tensor and its contractions are introduced in Sec. 1.9, along 
with the Einstein field equations. The geometrical meaning of the Riemann tensor 
is explored in Sec. 1.10, in which we derive the equation of geodesic deviation. 
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Fundamentals 



1.1 Vectors, dual vectors, and tensors 

Consider a curve 7 on a manifold. The curve is parameterized by A and is described 
in an arbitrary coordinate system by the relations x a (X). We wish to calculate the 
rate of change of a scalar function f(x a ) along this curve: 

tf_dldx^_ 

dX ~ dx" d\ ~ ha ■ 

This procedure allows us to introduce two types of objects on the manifold: u a = 
dx a /d\ is a vector which is everywhere tangent to 7, and /, a = df/dx a is a dual 
vector, the gradient of the function /. These objects transform as follows under an 
arbitrary coordinate transformation from x a to x a : 

t df df dx a _ dx a 

1 J,a 



J ' dx a ' dx a dx a ' dx a 
and 

„. dx a ' dx a ' dx a dx c 
u = 



dX dx a dX dx a 

From these equations we recover the fact that df /dX is an invariant: f >a 'U a ' = f >a u a . 
Any object A a which transforms as 

A a = ^_ A a (1.1.1) 

dx a v ; 

under a coordinate transformation will be called a vector: On the other hand, any 
object p a which transforms as 

dx a 

Pa ' = dx^ Pa (LL2) 

under the same coordinate transformation will be called a dual vector. The con- 
traction A a p a between a vector and a dual vector is invariant under the coordinate 
transformation, and is therefore a scalar. 

Generalizing these definitions, a tensor of type (n,ra) is an object T a "' /3 7 ... (5 
which transforms as 



' dx a dxP dx-y' dx 5 ' 



T a " s (1.1.3) 



under a coordinate transformation. The integer n is equal to the number of super- 
scripts, while m is equal to the number of subscripts. It should be noted that the 
order of the indices is important; in general, T /3 "' a 7 ... (5 ^ J 1 "'"' 3 g . By definition, 
vectors are tensors of type (1, 0), and dual vectors are tensors of type (0, 1). 

A very special tensor is the metric tensor g a p, which is used to define the inner 
product between two vectors. It is also the quantity that represents the gravitational 
field in general relativity. The metric or its inverse g a/3 can be used to lower or raise 
indices. For example, A a = g a ^A^ and p a = g a ^pp. The inverse metric is defined 
by the relations g^g^p = 8 a p. The metric and its inverse are symmetric tensors. 

Tensors are not actually defined on the manifold itself. To illustrate this, con- 
sider the vector u a tangent to the curve 7, as represented in Fig. 1.1. The diagram 
makes it clear that the tangent vector actually "sticks out" of the manifold. In fact, 
a vector at a point P on the manifold is defined in a plane tangent to the manifold 
at that point; this plane is called the tangent plane at P. Similarly, tensors at a 
point P can be thought of as living in this tangent plane. Tensors at P can be 
added and contracted, and the result is also a tensor. However, a tensor at P and 
another tensor at Q cannot be combined in a tensorial way, because these tensors 
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Figure 1.1: A tensor at P lives in the manifold's tangent plane at P. 



belong to different tangent planes. For example, the operations A a {P)B l3 {Q) and 
A a {Q) — A a {P) are not defined as tensorial operations. This implies that differen- 
tiation is not a straightforward operation on tensors. To define the derivative of a 
tensor, a rule must be provided to carry the tensor from one point to another. 



1.2 Covariant differentiation 

One such rule is parallel transport. Consider a curve 7, its tangent vector u a , and 
a vector field A" defined in a neighbourhood of 7 (Fig. 1.2). Let point P on the 
curve have coordinates x a , and point Q have coordinates x a + dx". As was stated 
previously, the operation 

dA a = A a {Q)-A a {P) 

= A a (x 13 +dx' 3 ) - A a (x 13 ) 
= A a 3 dx f} 

is not tensorial. This is easily checked: under a coordinate transformation, 

.„< d dx"' _ dx"' dx 13 d 2 x"' dx 13 

' ~ "537 A Jj + „ „ a a ai A , 



• p dx? dx a dx a dxP' ' p dx a dxP dxP' 

which is not a tensorial transformation. To be properly tensorial, the derivative 
operator should have the form DA" = A%(P) — A a (P), where A%(P) is the vector 
that is obtained by "transporting" A a from Q to P. We may write this as DA a = 
dA" + SA a , where 5A a = A%(P) - A a (Q) is also not a tensorial operation. The 
precise rule for parallel transport must now be specified. We demand that 6A a be 
linear in both A" and dx 13 , so that 6A a = T a ^ A" dx 13 for some (nontensorial) field 
r a M/3 called the connection. A priori, this field is freely specifiable. 

We now have DA a = A a p dx p +T a ^ A^dx 13 , and dividing through by dX, the 
increment in the curve's parameter, we obtain 

f"V. (>•") 

where u 13 = dx 13 fd\ is the tangent vector, and 

A^^A^ + T^A". (1.2.2) 

This is the covariant derivative of the vector A a . Other standard notations are 
A a . p = VpA 01 and DA a /dX = V u A a . 
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Figure 1.2: Differentiation of a tensor. 



The fact that A a ^ is a tensor allows us to deduce the transformation property 
of the connection. Starting from T a pA 11 = A"^ — A a p, it is easy to show that 

Expressing A^' in terms of A^ on the left-hand side and using the fact that A^ is 
an arbitrary vector field, we obtain 

. dx"' _ dx a ' dx p d 2 x a ' dx? 



Multiplying through by dx" /dx 1 ' and rearranging the indices, we arrive at 

• _ dx"' dx 13 dx" d 2 x a ' dx 13 dx" 

v' 13 ' = 'dx^dx^ r dx^ r " p ~ dx"dxl 3 dx^ r dx^ r ' ( ' ' 

Covariant differentiation can be extended to other types of tensors by demanding 
that the operator D obey the product rule of differential calculus. (For scalars, it 
is understood that D = d.) For example, we may derive an expression for the 
covariant derivative of a dual vector from the requirement 

d(A a Pa ) ee D(A a Pa ) = (DA a ) Pa + A a D( Pa ). 

Writing the left-hand side as A a j3 p a dx' 3 + A a p a ^dx 13 and using Eqs. (1.2.1) and 
(1.2.2), we obtain 

where 

Pa;/3 = Pa,P ~ ^apPv (1.2.5) 

This procedure generalizes easily to tensors of arbitrary type. For example, the 
covariant derivative of a type-(l, 1) tensor is given by 

T%, = + r a „ 7 T£ - T^Tl . (1.2.6) 



The rule is that there is a connection term for each tensorial index; it comes with a 
plus sign if the index is a superscript, or with a minus sign if the index is a subscript. 
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Up to now the connection has been left completely arbitrary. A specific choice 
is made by demanding that it be symmetric and metric compatible, 

r a 7/ 3 = r Q £ 7 , 5a/3; 7 = 0. (1-2.7) 

In general relativity, these properties come as a consequence of Einstein's principle 
of equivalence. It is easy to show that Eqs. (1-2.7) imply 

1^ = \ 9 a » (<W,7 + W - 9fr,v) • (1-2-8) 

Thus, the connection is fully determined by the metric. In this context, T a p are 
called the Christoffel symbols. 

We conclude this section with some terminology: A tensor field T a " is said 
to be parallel transported along a curve 7 if its covariant derivative along the curve 
vanishes: DT a - p ..J dX = V" - = 0. 

1.3 Geodesies 

A curve is a geodesic if it extremizes the distance between two fixed points. 

Let a curve 7 be described by the relations x a (X), where A is an arbitrary 
parameter, and let P and Q be two points on this curve. The distance between P 
and Q along 7 is given by 

£ = ^j±g aP x<*xP dX, (1.3.1) 

where x a = dx a jd\. In the square root, the positive (negative) sign is chosen if the 
curve is spacelike (timelike); it is assumed that 7 is nowhere null. It is clear that I 
is invariant under a reparameterization of the curve, A — > A' (A). 

The curve for which I is an extremum is determined by substituting the "La- 
grangian" I/(x' i ,x' i ) = {±g IJlV x ,l x v ) 1 / 2 into the Euler-Lagrange equations, 

d dL 01. 
dXdi" ~ dx" 

A straightforward calculation shows that x a (A) must satisfy the differential equation 
'x a + r a /37 x /3 x 7 = fc(A)x" (arbitrary parameter), (1.3.2) 

where k = dlnL/dX. The geodesic equation can also be written as u a .pvP = ku", 
in which u a = x a is tangent to the geodesic. 

A particularly useful choice of parameter is proper time r when the geodesic 
is timelike, or proper distance s when the geodesic is spacelike. (It is important 
that this choice be made after extremization, and not before.) Because dr 2 = 
—g a pdx a dx 13 for timelike geodesies and ds 2 = g a pdx a dx 13 for spacelike geodesies, 
we have that L = 1 in either case, and this implies k = 0. The geodesic equation 
becomes 

'x a + r a /37 i /3 x 7 = (affine parameter), (1.3.3) 

or u a ^vP = 0, which states that the tangent vector is parallel transported along 
the geodesic. These equations are invariant under reparameterizations of the form 
A — > A' = aX + b, where a and b are constants. Parameters related to s and r by such 
transformations are called affine parameters. It is useful to note that Eq. (1.3.3) 
can be recovered by substituting L' = \g a ^x a x^ into the Euler-Lagrange equations; 
this gives rise to practical method of computing the Christoffel symbols. 
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By continuity, the general form = ku" for the geodesic equation must 

be valid also for null geodesies. For this to be true, the parameter A cannot be 
affine, because ds = dr = along a null geodesic, and the limit is then singular. 
However, affine parameters can nevertheless be found for null geodesies. Starting 
from Eq. (1.3.2) it is always possible to introduce a new parameter A* such that 
the geodesic equation will take the form of Eq. (1.3.3). It is easy to check that the 
appropriate transformation is 




(1.3.4) 



(You will be asked to provide a proof of this statement in Sec. 1.13, Problem 2.) It 
should be noted that while the null version of Eq. (1.3.2) was obtained by a limiting 
procedure, the null version of Eq. (1.3.3) cannot be considered to be a limit of the 
same equation for timelike or spacelike geodesies: the parameterization is highly 
discontinuous. 

We conclude this section with the following remark: Along an affinely param- 
eterized geodesic (timelike, spacelike, or null), the scalar quantity e = u a u a is a 
constant. The proof requires a single line: 

ds 

— = {u a u a ). tP u p = (u a . /3 u /3 )u a +u a (u a . / 3u' 3 ) = 0. 

If proper time or proper distance is chosen for A, then e = respectively. For a 
null geodesic, e = 0. 



1.4 Lie differentiation 

In Sec. 1.2, covariant differentiation was defined by introducing a rule to transport 
a tensor from a point Q to a neighbouring point P, at which the derivative was 
to be evaluated. This rule involved the introduction of a new structure on the 
manifold, the connection. In this section we define another type of derivative — 
the Lie derivative — without introducing any additional structure. 

Consider a curve 7, its tangent vector u a = dx a /dX, and a vector field A a 
defined in a neighbourhood of 7 (Fig. 1.2). As before, the point P shall have the 
coordinates x a , while the point Q shall be at x a + dx a . The equation 

x' a =x a + dx a = x a + u a dX 

can be interpreted as an infinitesimal coordinate transformation from the system x 
to the system x'. Under this transformation, the vector A a becomes 

A ' a{x>) = W A " {X) 

= ^ a p + u a f}d\)A p {x) 
= A a (x) + u a /3 A l3 (x)dX. 

In other words, 

A' a (Q) = A a (P) + u a p A p {P) dX. 

On the other hand, A a (Q), the value of the original vector field at the point Q, can 
be expressed as 

A a (Q) = A a (x+dx) 

= A a (x)+A a p (x)dx 13 
= A a (P)+u l3 A a p (P)dX. 
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In general, A' a (Q) and A a (Q) will not be equal. Their difference defines the Lie 
derivative of the vector A a along the curve 7: 

£ A a (p ,- A<*(Q)-A«*(Q) 
LuA {n ~ dX • 

Combining the previous three equations yields 

£ u A a = A a p u p - u a p A p . (1.4.1) 

Despite an appearance to the contrary, £ u A a is a tensor: It is easy to check that 
Eq. (1.4.1) is equivalent to 

£ u A a = A^u 13 - u^A 13 , (1.4.2) 

whose tensorial nature is evident. 

The definition of the Lie derivative extends to all types of tensors. For scalars, 
£ u f = df = f, a u a . For dual vectors, the same steps reveal that 

£uPa = PcPU 13 + V? a pp 

(1.4.3) 

= Pa-.pU 13 +U 13 [pp. 



As another example, the Lie derivative of a type-(l, 1) tensor is given by 

(1.4.4) 



J? rpa _ rpa II _ a rpfl , U rpa 



- 1 j3;n u u in 1 p+ u -S 1 n- 

Further generalizations are obvious. It may be verified that the Lie derivative obeys 
the product rule of differential calculus. For example, the relation 

£ u (A a pp) = (£ u A a )pp + A a (£ uP p) (1.4.5) 

is easily established. 

A tensor field T a "p... is said to be Lie transported along a curve 7 if its Lie 
derivative along the curve vanishes: £ u T a '" j3 ... = 0, where u a is the curve's tangent 
vector. Suppose that the coordinates are chosen so that x 1 , x 2 , and x 3 are all 
constant on 7, while x° = X varies on 7. In such a coordinate system, 

U ~ dX 

where the symbol "=" means "equals in the specified coordinate system" . It follows 
that u a p = 0, so that 

n rpa--- rpa--- M ^ rpa--- 

&u-L P---L p-,„U - —1 p.... 

If the tensor is Lie transported along 7, then the tensor's components are all inde- 
pendent of x° in the specified coordinate system. 
We have established the following theorem: 

If £ u T a "' j3 ... = 0, that is, if a tensor is Lie transported along a curve 7 with 
tangent vector u a , then a coordinate system can be constructed such that u a = 5 a 
and T a "p... = 0. Conversely, if in a given coordinate system the components of a 
tensor do not depend on a particular coordinate x°, then the Lie derivative of the 
tensor in the direction of u a vanishes. 

Thus, the Lie derivative is the natural construct to express, covariantly, the 
invariance of a tensor under a change of position. 
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1.5 Killing vectors 

If, in a given coordinate system, the components of the metric do not depend on 
a; , then by the preceding theorem, £^g a /3 = 0, where £ a = 6 a . The vector £ a is 
then called a Killing vector. The condition for £ a to be a Killing vector is that 

= £^g a p = £a;P +£/3;a- (1.5.1) 

Thus, the tensor £ a -p is antisymmetric if is a Killing vector. 

Killing vectors can be used to find constants associated with the motion along 
a geodesic. Suppose that u a is tangent to a geodesic affinely parameterized by A. 
Then 

= 0. 

In the second line, the first term vanishes by virtue of the geodesic equation, and 
the second term vanishes because ^ a -p is an antisymmetric tensor and u a u^ is 
symmetric. Thus, u a ^ a is constant along the geodesic. 

As an example, consider a static, spherically symmetric spacetime with metric 

ds 2 = -A(r) dt 2 + B(r) dr 2 + r 2 dfi 2 , 

where d£l 2 = d6 2 + sin 2 6d(f> 2 . Because the metric does not depend on t nor (f>, the 
vectors 

_ dx a _ dx a 

kt) - m > «w) - d(j> 

are Killing vectors. This implies that along timelike geodesies, the quantities 

E = -« a ((t)j L = UaZfy 

are constants. These can be interpreted as energy and angular momentum per unit 
mass, respectively. It should also be noted that spherical symmetry implies the 
existence of two additional Killing vectors, 

€(i)®a = sm (f>d$ + cot 9 cos <j> d<p, £(2)9 a = — cos <j> do + cot 6 sin cf> 0$. 

It is straightforward to show that these do indeed satisfy Killing's equation (1.5.1). 
(To prove this is the purpose of Sec. 1.13, Problem 5.) 

1.6 Local flatness 

For a given point P in spacetime, it is always possible to find a coordinate system 
x a such that 

top (P) = Va-p- , r«;, 7 , (P) = 0, (1.6.1) 

where Jj a '/3' = diag(— 1, 1, 1, 1) is the Minkowski metric. Such a coordinate system 
will be called a local Lorentz frame at P. We note that it is not possible to also 
set the derivatives of the connection to zero if the spacetime is curved. The phys- 
ical interpretation of the local-flatness theorem is that free-falling observers see no 
effect of gravity in their immediate vicinity, as required by Einstein's principle of 
equivalence. 
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We now prove the theorem. Let x" be an arbitrary coordinate system, and let 
us assume, with no real loss of generality, that P is at the origin of both coordinate 
systems. Then the coordinates of a point near P are related by 

x a ' = A a ' pX P + 0(x 2 ), x a = A a p lX P' + 0(x' 2 ), 

where A a p and A a p, are constant matrices. It is easy to check that one is in fact 
the inverse of the other: 

ja' /1M _ X<*' A a A^' — A a 

A n A p< — P'> A n' A p — p- 
Under this transformation, the metric becomes 

g a ,p,(P)=A a a ,A^,g aP (P). 

We demand that the left-hand side be equal to f] a <p<- This gives us 10 equations for 
the 16 unknown components of the matrix A a a , . A solution can always be found, 
with 6 undetermined components. This corresponds to the freedom of performing a 
Lorentz transformation (3 rotation parameters and 3 boost parameters) which does 
not alter the form of the Minkowski metric. 

Suppose that a particular choice has been made for A a a ,. Then A a a is found 
by inverting the matrix, and the coordinate transformation is known to first order. 
Let us proceed to second order: 

x a ' = A a 'pX p + i B^x^x 1 + 0(x 3 ), 

where the constant coefficients B a p are symmetric in the lower indices. Recalling 
Eq. (1.2.3), we have that the connection transforms as 

r a /3' 7 '(-P) = A^A^p, A 1 J ,r a p 1 (P) - B a j3l A l3 p l A' 1 1 ,. 
To put the left-hand side to zero, it is sufficient to impose 

B a pn, = -A a a T a p 1 (P). 

These equations determine -B°g 7 uniquely, and the coordinate transformation is 
now known to second order. Irrespective of the higher-order terms, it enforces 
Eqs. (1.6.1). 

We shall return in Sec. 1.11 with a more geometric proof of the local- flatness 
theorem, and its extension from a single point P to an entire geodesic 7. 

1.7 Metric determinant 

The quantity y/^g, where g = det[g a p], occurs frequently in differential geometry. 
We first note that \J g' /g, where g' = det[g a >p>], is the Jacobian of the transforma- 
tion x a — > x a (x a ). To see this, recall from ordinary differential calculus that under 
such a transformation, d A x = J d 4 x', where J = det[dx a /dx a ] is the Jacobian. 
Now consider the transformation of the metric, 

dx a dx 13 
9a ' P ' = dx* dxf>' 9aP - 

Because the determinant of a product of matrices is equal to the product of their 
determinants, this equation implies g' = gJ 2 , which proves the assertion. 

As an important application, consider the transformation from x a , a local 
Lorentz frame at P, to x a , an arbitrary coordinate system. The four-dimensional 
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volume element around P is d 4 x' = J 1 d 4 x = yjg/g' d 4 x. But since g' = —1, we 
have that 

^/=gd 4 x (1.7.1) 

is an invariant volume element around the arbitrary point P. This result generalizes 
to a manifold of any dimension with a metric of any signature; in this case, \g\ 1 / 2 d n x 
is the invariant volume element, where n is the dimension of the manifold. 
We shall now derive another useful result, 

r% = l^g^a = ^=(V=»), a - (1-7.2) 

Consider, for any matrix M, the variation of ln|detAf| induced by a variation of 
iVf's components. Using the product rule for determinants, we have 

<51n|detM| = In |det(M + 6M)\ - In |detM| 
det(M + 6M) 
n detM 
= IndetM' 1 (M + SM) 
= lndet(l + M _1 <5M). 

We now use the identity det(l +e) = 1 +Tre + 0(e 2 ), valid for any "small" matrix 
e. (Try proving this for 3 x 3 matrices.) This gives 

<51n|detM| = ln(l + Tr M~ X 6M) 
= Tr M~ l 5M. 

Substituting the metric tensor in place of M gives <51n \g\ = g a/3 6g a p, or 

^ In |s | =g afi g a p tlt . 

This establishes Eq. (1.7.2). 

Equation (1.7.2) gives rise to the divergence formula: For any vector field A a , 

^ = 7=(^^),a- (1-7.3) 

A similar result holds for any antisymmetric tensor field B al3 : 

These formulae are useful for the efficient computation of covariant divergences. 

1.8 Levi-Civita tensor 

The permutation symbol [afl-yS], defined by 

{+1 if afl-yS is an even permutation of 0123 
— 1 if aP^/S is an odd permutation of 0123 , (1.8.1) 
if any two indices are equal 

is a very useful, non-tensorial quantity. For example, it can be used to give a 
definition for the determinant: For any 4x4 matrix M a p, 

det[M aj3 ] = [a fi-y S]M 0a M 1/3 M 2l M 3S 

(1.8.2) 

= [a/3'y5]M a0 M /n M 1 2M S3 . 
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Either equality can be established by brute-force computation. The well-known 
property that det[Mg a ] = det[M a/ g] follows directly from Eq. (1.8.2). 
We shall now show that the combination 

£«/3 7 <5 = V-giafllS] (1.8.3) 
is a tensor, called the Levi-Civita tensor. Consider the quantity 

,dx a dx? dx 1 dx 5 



[aP-yS] 



dx a ' dx?' dx-y' dx 6 ' ' 



which is completely antisymmetric in the primed indices. This must therefore be 
proportional to [a' P' 7' 5']: 

for some proportionality factor A. Putting a' P'^'5' = 0123 yields 

, dx a dx 13 dx 1 dx 5 



[a Pi 5} 



dx ' dx 1 ' dx 2 ' dx 3 ' 



which determines A. But the right-hand side is just the determinant of t he m atrix 
dx a /dx a , that is, the Jacobian of the transformation x a (x a ). So A = \/g'/g, and 
we have 

dx a dx 13 dx 1 dx 5 



^ a ^ s ^dWdxV^ = v-9'WP','n 



This establishes the fact that Sap-yd does indeed transform as a type-(0,4) tensor. 
The preceding proof could have started instead with the relation 

L 1 dx a dxP dx"< dx 5 L 1 



implying A' = \J gjg' and showing that 



[a Pi 5] (1.8.4) 



transforms as a type-(4,0) tensor. (The minus sign is important.) It is easy to 
check that this is also the Levi-Civita tensor, obtained from e a p 7 s by raising all 
four indices. Alternatively, we may show that £ a p-y6 = gaiidPvg-txg&pZ^^ ■ This 
relation implies 

£0123 = - -y= [a* v a p]9ov,9ivg2\g3 P = - -y= g = V~9, 

which is evidently compatible with Eq. (1.8.3). 

The Levi-Civita tensor is used in a variety of contexts in differential geometry. 
We will meet it again in Chapter 3. 

1.9 Curvature 

The Riemann tensor R a p 7 g may be defined by the relation 

^., ap -A» pa = -Ri* vap A", (1.9.1) 
which holds for any vector field A a . Evaluating the left-hand side explicitly yields 

R a p-ys = r% j7 - r a £ 7>(5 + r^r^pg - r^r^. (1.9.2) 
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The Riemann tensor is obviously antisymmetric in the last two indices. Its other 
symmetry properties can be established by evaluating in a local Lorentz frame 
at some point P. A straightforward computation gives 

Rap-yS = ^ {9ad,p-y ~ 9a-y,P5 ~ 9p5,a-y + 9f3-y,as), 

and this implies the tensorial relations 

Rafi-yS = —Rfia-yS = —Ra/36-y = R-ySa/3 (1.9.3) 

and 

Ruafi-y + Rn-yafi + R^-ya = 0, (1.9.4) 

which are valid in any coordinate system. A little more work along the same lines 
reveals that the Riemann tensor satisfies the Bianchi identities, 

R^vaf3;"f R/Myya;@ R/M'fl'y;a 0* (1.9.5) 

In addition to Eq. (1.9.1), the Riemann tensor satisfies the relations 

~ Putfoc = R V m fiVv (1-9-6) 

and 

T% aP ~ T% a = ~R\ ap T\ + R\ af) T\, (1.9.7) 

which hold for arbitrary tensors p a and T°p. Generalization to tensors of higher 
ranks is obvious: the number of Riemann-tensor terms on the right-hand side is 
equal to the number of tensorial indices. 

Contractions of the Riemann tensor produce the Ricci tensor R a p and the Ricci 
scalar R. These are defined by 

= fl" awfJ) R = R a a - (1-9.8) 

It is easy to show that R a p is a symmetric tensor. The Einstein tensor is defined 
by " 

G a = Ra/3 - ^ R9aP~, (1.9.9) 

this is also a symmetric tensor. By virtue of Eq. (1.9.5), the Einstein tensor satisfies 

G a/3 . p = 0, (1.9.10) 

the contracted Bianchi identities. 
The Einstein field equations, 

G afi =8nT a P, (1.9.11) 

relate the spacetime curvature (as represented by the Einstein tensor) to the dis- 
tribution of matter (as represented by T a/3 , the stress-energy tensor). Equation 
(1.9.10) implies that the stress-energy tensor must have a zero divergence: T" 13 ^ = 
0. This is the tensorial expression for energy-momentum conservation. Equation 
(1.9.10) implies also that of the ten equations (1.9.11), only six are independent. 
The metric can therefore be determined up to four arbitrary functions, and this 
reflects our complete freedom in choosing the coordinate system. We note that the 
field equations can also be written in the form 

RoP = 8n(T a/3 - i Tg a ^ , (1.9.12) 



where T = T a a is the trace of the stress-energy tensor. 
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1.10 Geodesic deviation 

The geometrical meaning of the Riemann tensor is best illustrated by examining 
the behaviour of neighbouring geodesies. Consider two such geodesies, 70 and 71, 
described by relations x a (t) in which t is an affine parameter; the geodesies can 
be either spacelike, timelike, or null. We want to develop the notion of a deviation 
vector between these two geodesies, and derive an evolution equation for this vector. 

For this purpose we introduce, in the space between 70 and 71 , an entire family of 
interpolating geodesies (Fig. 1.3). To each geodesic we assign a label s € [0, 1], such 
that 70 comes with the label s = and 71 with s = 1. We collectively describe these 
geodesies with relations x a (s,t), in which s serves to specify which geodesic and t 
is an affine parameter along the specified geodesic. The vector field u a = dx a jdt is 
tangent to the geodesies, and it satisfies the equation u a .^vP = 0. 

If we keep t fixed in the relations x a (s, t) and vary s instead, we obtain another 
family of curves, labelled by t and parameterized by s; in general these curves will 
not be geodesies. The family has £ a = dx a /ds as its tangent vector field, and the 
restriction of this vector to 70, £ a | s =o, gives a meaningful notion of a deviation 
vector between 70 and 71 . We wish to derive an expression for its acceleration, 

EE (1-10-1) 

in which it is understood that all quantities are to be evaluated on 70. In flat 
spacetime, the geodesies 70 and 71 are straight, and although their separation may 
change with t, this change is necessarily linear: D 2 £ a /dt 2 = in flat spacetime. 
A nonzero result for D 2 £ a /dt 2 will therefore reveal the presence of curvature, and 
indeed, this vector will be found to be proportional to the Riemann tensor. 
It follows at once from the relations u a = dx a jdt and £ a = dx a /ds that 

£ „r = £?w a = => = u a .^. (1.10.2) 

We also have at our disposal the geodesic equation, u a ^uP = 0. These equations 
can be combined to prove that £ a u a is constant along 70 : 

= \{u a u a ), e 
= 0, 




Figure 1.3: Deviation vector between two neighbouring geodesies. 
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because u a u a = e is a constant. The parameterization of the interpolating geodesies 
can therefore be tuned so that on -y , £ a is everywhere orthogonal to u a : 

£ a u a = 0. (1.10.3) 

This means that the curves t = constant cross 70 orthogonally. This adds weight 
to the interpretation of as a deviation vector. 

We may now calculate the relative acceleration of 71 with respect to 70. Starting 
from Eq. (1.10.1) and using Eqs. (1.9.1) and (1.10.2), we obtain 

(OAt^ 7 
u a . f) ^u>+u a . f) Z%u> 

The first term vanishes by virtue of the geodesic equation, while the second and 
fourth terms cancel out, leaving 

= -R% s u^Cu 6 . (1.10.4) 

This is the geodesic deviation equation. It shows that curvature produces a rela- 
tive acceleration between two neighbouring geodesies; even if they start parallel, 
curvature prevents the geodesies from remaining parallel. 



dt 2 



1.11 Fermi normal coordinates 

The proof of the local-flatness theorem presented in Sec. 1.6 gives very little in- 
dication as to how one might construct a coordinate system that would enforce 
Eqs. (1.6.1). Our purpose in this section is to return to this issue, and provide a 
more geometric proof of the theorem. In fact, we will extend the theorem from a 
single point P to an entire geodesic 7. For concreteness we will take the geodesic 
to be timelike. 

We will show that we can introduce coordinates x a = (t, x a ) such that near 7, 
the metric can be expressed as 

g tt = -1-Rt a tb(t)x a x b +0(x 3 ), 

9ta = -^Rtbac(t)x b X C +0(X 3 ), (1.11.1) 
9ab = Sab- ^R ac bd(t)x C X d + 0(X 3 ). 

These coordinates are known as Fermi normal coordinates, and t is proper time 
along the geodesic 7, on which the spatial coordinates x a are all zero. In Eq. (1.11.1), 
the components of the Riemann tensor are evaluated on 7, and they depend on t 
only. It is obvious that Eq. (1.11.1) enforces g a p\-y = V<*p an d r M a/3 | 7 = 0. The 
local-flatness theorem therefore holds everywhere on the geodesic. 

1.11.1 Geometric construction 

We will use x a = (f , x a ) to denote the Fermi normal coordinates, and x a ' will refer 
to an arbitrary coordinate system. We imagine that we are given a spacetime with 
a metric g a 'p' expressed in these coordinates. 
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We consider a timelike geodesic 7 in this spacetime. Its tangent vector is u a , 
and we let t be proper time along 7. On this geodesic we select a point O at which 
we set t = 0. At this point we erect an orthonormal basis e£ (the subscript n 
serves to label the four basis vectors), and we identify e"' with the tangent vector 
u a at O. From this we construct a basis everywhere on 7 by parallel transporting 

away from O. Our basis vectors therefore satisfy 

e% p ,uP' =0, ef'=u a ', (1.11.2) 

as well as 

3a'/3<e£'ef = rj^, (1.11.3) 

everywhere on 7. Here, ?j M „ = diag(— 1, 1, 1, 1) is the Minkowski metric. 

Consider now a spacelike geodesic p originating at a point P on 7, at which 
t = tp. This geodesic has a tangent vector v a , and we let s denote proper distance 
along /3; we set s = at P. We assume that at P, v a is orthogonal to u a , so that 
it admits the decomposition 

v a '\^ = Sl a ^'. (1.11.4) 

To ensure that v a ' is properly normalized, the expansion coefficients must satisfy 
5 o( ,fi°fi 6 = 1. By choosing different coefficients fi° we can construct new geodesies 
(3 that are also orthogonal to 7 at P. We shall denote this entire family of spacelike 
geodesies by /3(£p,f2°). 

The Fermi normal coordinates of a point Q located off the geodesic 7 are con- 
structed as follows (Fig. 1.4). First we find the unique geodesic that passes through 
Q and intersects 7 orthogonally. We label the intersection point P, and we call this 
geodesic /3(ip,f2g), with tp denoting proper time at the intersection point, and 

fig the expansion coefficients of v a ' at that point. We then assign to Q the new 
coordinates 

x° = t P , x a = n a Q s Q , (1.11.5) 

where sq is proper distance from P to Q. These are the Fermi normal coordinates 
of the point Q. Generically, therefore, x a = (t,£l a s), and we must now figure out 
how these coordinates are related to x a , the original system. 

1.11.2 Coordinate transformation 

For this purpose, we note first that we can describe the family of geodesies fl(t, fi°) 
by relations of the form x a (t,£l a ,s). In these, the parameters t and fi° serve to 




Figure 1.4: Geometric construction of the Fermi normal coordinates. 
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specify which geodesic, and s is proper distance along this geodesic. If we substitute 
s = in these relations, we recover the description of the timelike geodesic 7 in 
terms of its proper time t; the parameters Q a are then irrelevant. The tangent to 
the geodesies /3(f , fi°) is 

f dx a ' 



v = 



ds 



(1.11.6) 



the notation explicitly indicates that the derivative with respect to s is taken while 
keeping t and fi° fixed. This vector satisfies the geodesic equation and is subjected 
to the initial condition v a | s=0 = n a e" . But the geodesic equation is invariant 
under a rescaling of the affine parameter, s — > s/c, in which c is a constant. Under 
this rescaling, v a ->c«" and as a consequence, we have that fi° — > cQ a . We have 
therefore established the identity x a (t,£l a ,s) = x a (t,c£l a ,s/c), and as a special 
case, we find 

x a ' (t, Sl a , s) = x a ' (t, 1) = x a ' (x a ). (1.11.7) 

By virtue of Eqs. (1.11.5), this relation is the desired transformation between x a ' 
and the Fermi normal coordinates. 

Now, as a consequence of Eqs. (1.11.4), (1.11.6), and (1.11.7), we have 



dx a 



ds 



dx a 



s=0 



dx a 



s=0 



which shows that 



dx a 



dx a 



= e„ 



(1.11.8) 



From our previous observation that the relations x a (t, fi°, 0) describe the geodesic 
7, we also have 

Br a ' 

' 7 ■ (l-H-9) 



dt 



= u = e+ 



Equations (1.11.8) and (1.11.9) tell us that on 7, dx a ' /dx» = 



1.11.3 Deviation vectors 

Suppose now that in the relations x a ' (t, fi°, s), the parameters fi° are varied while 
keeping t and s fixed. This defines new curves that connect different geodesies /3 at 
the same proper distance s from their common intersection point P on 7. This is 
very similar to the construction described in Sec. 1.10, and the vectors 



e = (i.n.i<» 



are deviation vectors relating geodesies /3(f, fi°) with different coefficients fi°. Sim- 
ilarly, 



is a deviation vector relating geodesies fl(t, fi°) that start at different points on 7, 
but share the same coefficients fi°. The four vectors defined by Eqs. (1.11.10) and 
(1.11.11) satisfy the geodesic deviation equation, Eq. (1.10.4). (It must be kept 
in mind that in this equation, the tangent vector is v a , not u a , and the affine 
parameter is s, not t.) 
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1.11.4 Metric on 7 

The components of the metric in the Fermi normal coordinates are related to the 
old components by the general relation 

dx a ' dx 13 ' 

t ol 

Evaluating this on 7 yields g^l-y = e^J g a 'P', after using Eqs. (1.11.8) and 
(1.11.9). Substituting Eq. (1.11.3), we arrive at 

5a/3| 7 = Va/3- (1.11.12) 

This states that in the Fermi normal coordinates, the metric is Minkowski every- 
where on the geodesic 7. 



1.11.5 First derivatives of the metric on 7 

To evaluate the Christoffel symbols in the Fermi normal coordinates, we recall from 
Eq. (1.11.5) that the curves x° = t, x a = fi°s are geodesies, so that these relations 
must be solutions to the geodesic equation, 

d 2 x a dx dxi _ 

This gives T a bc {x a )£l b £l c = 0. On 7, the Christoffel symbols are functions of t 
only, and are therefore independent of Since these coefficients are arbitrary, we 
conclude that r a 6c | 7 = 0. To obtain the remaining components, we recall that the 
basis vectors ejj are parallel transported along 7, so that 

since e] = u a . By virtue of Eqs. (1.11.8) and (1.11.9), we have that = 5 a M in the 
Fermi normal coordinates, and the parallel-transport equation implies r a ^ t | 7 = 0. 
The Christoffel symbols are therefore all zero on 7. We shall write this as 

<M, 7 | 7 = 0. (1.11.13) 

This proves that the Fermi normal coordinates enforce the local-flatness theorem 
everywhere on the timelike geodesic 7. 



1.11.6 Second derivatives of the metric on 7 

We next turn to the second derivatives of the metric, or the first derivatives of the 
connection. From the fact that is zero everywhere on 7, we obtain immediately 

rVtl =0. (i.n.14) 



From the definition of the Riemann tensor, Eq. (1.9.2), we also get 

r%, 7 | 7 = i?Vl 7 - (1.11.15) 

The other components are harder to come by. For these we must involve the devi- 
ation vectors £™ introduced in Eqs. (1.11.10) and (1.11.11). These vectors satisfy 
the geodesic deviation equation, Eq. (1.10.4), which we write in full as 
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According to Eqs. (1.11.5), (1.11.6), (1.11.10), and (1.11.11), we have that v a = 
£l a S a a , = 6 a t , and = s6 a a in the Fermi normal coordinates. If we substitute 
£ a = £t m the geodesic deviation equation and evaluate it at s = 0, we find 
r a &t c It = ^™6ctl7' which is just a special case of Eq. (1.11.15). 

To learn something new, let us substitute £ a = instead. In this case we find 

9r a o 6 -i-<i/'ff a -4- r a — r a r M -i-r a r M ^o 6 o d — n 

Before evaluating this on 7 (which would give = 0), we expand the first term in 
a power series in s: 

r a o6 = r« a6 | 7 + *r* o6 J 7 „" + o(* 2 ) = *r« aM | 7 " d + o(* 2 )- 

Dividing through by s and then evaluating on 7, we arrive at 

(i?« 6ad + 3r« M )| 7 « 6 « d = o. 

Because the coefficients fi° are arbitrary, we conclude that the quantity within the 
brackets, properly symmetrized in the indices b and d, must vanish. A little algebra 
finally reveals 

rVc| 7 = -^(^6c+^6ac)| 7 - (l-H-16) 

Equations (1.11.14), (1.11.15), and (1.11.16) give the complete set of derivatives of 
the Christoffel symbols on 7. 

It is now a simple matter to turn these equations into statements regarding the 
second derivatives of the metric at 7. Because the metric is Minkowski everywhere 
on the geodesic, only the spatial derivatives are nonzero. These are given by 

9tt, ab = — 2-Riot6| 7 , 

2 , 

9ta,bc = — g [Rtbac + Rtcab) (1.11.17) 
9ab,cd = --^(Racbd + Radbc)]^- 

From Eqs. (1.11.12), (1.11.13), and (1.11.17) we recover Eqs. (1.11.1), the expansion 
of the metric about 7, to second order in the spatial displacements x a . 

1.11.1 Riemann tensor in Fermi normal coordinates 

To express a given metric as an expansion in Fermi normal coordinates, it is nec- 
essary to evaluate the Riemann tensor on the reference geodesic, and write it as a 
function of t in this coordinate system. This is not as hard as it may seem. Because 
the Riemann tensor is evaluated on 7, we need to know the coordinate transforma- 
tion only at 7; as was noted above, this is given by dx a jdx^ = e% . We therefore 
have, for example, 

Rtabcit) = Rfi'a'p'-y' et e" ej . 

The difficult part of the calculation is therefore the determination of the orthonormal 
basis (which is parallel transported on the reference geodesic). Once this is known, 
the Fermi components of the Riemann tensor are obtained by projection, and these 
will naturally be expressed in terms of t. 



1.12 Bibliographical notes 
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1.12 Bibliographical notes 

Nothing in this text can be claimed to be entirely original, and the bibliographical 
notes at the end of each chapter intend to give credit where credit is due. During 
the preparation of this chapter I have relied on the following references: d'Inverno 
(1992); Manasse and Misner (1963); Misner, Thorne, and Wheeler (1973); Wald 
(1984); and Weinberg (1972). 
More specifically: 

Sections 1.2, 1.4, and 1.6 are based on Sees. 6.3, 6.2, and 6.11 of d'Inverno, 
respectively. Sections 1.7 and 1.8 are based on Sees. 4.7 and 4.4 of Weinberg, 
respectively. Section 1.10 is based on Sec. 3.3 of Wald. Finally, Sec. 1.11 and 
Problem 10 below are based on the paper by Manasse and Misner. 

1.13 Problems 

Warning: The results derived in Problem 9 are used in later portions of this book. 

1 . The surface of a two-dimensional cone is embedded in three-dimensional flat 
space. The cone has an opening angle of 2a. Points on the cone which all 
have the same distance r from the apex define a circle, and <j> is the angle that 
runs along the circle. 

a) Write down the metric of the cone, in terms of the coordinates r and <f>. 

b) Find the coordinate transformation x(r, <f>), y(r, <j>) that brings the metric 

into the form ds 2 = dx 2 + dy 2 . Do these coordinates cover the entire 
two-dimensional plane? 

c) Prove that any vector parallel transported along a circle of constant r on 

the surface of the cone ends up rotated by an angle j3 after a complete 
trip. Express j3 in terms of a. 

2. Show that if t a = dx a jdX obeys the geodesic equation in the form t a .^P = 
Kt a , then u a = dx a /d\* satisfies u a pU^ = if A* and A are related by 
dX* /d\ = exp j k(X) dX. 

3. a) Let x a (X) describe a timelike geodesic parameterized by a nonaffine param- 

eter A, and let t a = dx a /dX be the geodesic's tangent vector. Calculate 
how £ = —t a t a changes as a function of A. 

b) Let £ a be a Killing vector. Calculate how p = £ a t a changes as a function 

of A on that same geodesic. 

c) Let b a be such that in a spacetime with metric g a p, £b9ap = 2cg a p, where 

c is a constant. (Such a vector is called homothetic.) Let x a (r) describe 
a timelike geodesic parameterized by proper time r, and let u a = dx a /dr 
be the four-velocity. Calculate how q = b a u a changes with r. 

4. Prove that the Lie derivative of a type-(0,2) tensor is given by £ u T a p = 
T a ^u" +u". a T„ j 8+u". jf ,T a „. 

5. Prove that £^ and as given in Sec. 1.5, are indeed Killing vectors of 
spherically symmetric spacetimes. 

6. A particle with electric charge e moves in a spacetime with metric g a p in the 
presence of a vector potential A a . The equations of motion are u a; puP = 
eF a puP, where u a is the four-velocity and F a p = Ap ;a — A a; p. It is assumed 
that the spacetime possesses a Killing vector £ a , so that £^g a p = £$A a = 0. 
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Prove that 

(u a + eA a )Z a 
is constant on the world line of the charged particle. 

7. In flat spacetime, all Cartesian components of the Levi-Civita tensor can be 
obtained from £txyz = 1 by permutation of the indices. Using its tensorial 
property under coordinate transformations, calculate £ a P"/S in the following 
coordinate systems: 

a) Spherical coordinates (t,r,6,<f>). 

b) Spherical-null coordinates (u, v, 6, <j>), where u = t — r and v = t + r. 

Show that your results are compatible with the general relation £ a /3 7< 5 = 
\/^g[a fl-y 6] if [tr 6 </>] = 1 in spherical coordinates, while [uv6 4>] = 1 in 
spherical-null coordinates. 

8. In a manifold of dimension n, the Weyl curvature tensor is defined by 

2 / \ 2 

Ca/3-jS = Ra/3-yS ~ (5a[ 7 #<5]/3 ~ 9fi[-yRs]a ) + ( n - l)(n - 2) R9a h 9d ^ m 

Show that it possesses the same symmetries as the Riemann tensor. Also, 
prove that any contracted form of the Weyl tensor vanishes identically. This 
shows that the Riemann tensor can be decomposed into a tracefree part given 
by the Weyl tensor, and a trace part given by the Ricci tensor. The Einstein 
field equations imply that the trace part of the Riemann tensor is algebraically 
related to the distribution of matter in spacetime; the tracefree part, on the 
other hand, is algebraically independent of the matter. Thus, it can be said 
that the Weyl tensor represents the true gravitational degrees of freedom of 
the Riemann tensor. 

9. Prove that the relations 

are satisfied by any Killing vector £ a . Here, □ = V a V a is the curved- 
spacetime d'Alembertian operator. [Hint: Use the cyclic identity for the 
Riemann tensor, i2^ aj g 7 + i? M7a /3 + R^p-ya = 0.] 

10. Express the Schwarzschild metric as an expansion in Fermi normal coordinates 
about a radially infalling, timelike geodesic. 

11. Construct a coordinate system in a neighbourhood of a point P in spacetime, 
such that g a p\p = Vap, 9 a p,n\p = 0, and 

Such coordinates are called Riemann normal coordinates. 

12. A particle moving on a circular orbit in a stationary, axially symmetric space- 
time is subjected to a dissipative force which drives it to another, slightly 
smaller, circular orbit. During the transition, the particle loses an amount 6E 
of orbital energy (per unit rest- mass), and an amount SL of orbital angular 
momentum (per unit rest- mass) . You are asked to prove that these quantities 
are related by SE = QSL, where fi is the particle's original angular velocity. 

By "circular orbit" we mean that the particle has a four-velocity given by 



« a = 7($) + n^)), 
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where £^ and £^ are the spacetime's timelike and rotational Killing vectors, 
respectively; fi and 7 are constants. 

You may proceed along the following lines: First, express 7 in terms of E 
and L. Second, find an expression for 6u a , the change in four-velocity as the 
particle goes from its original orbit to its final orbit. Third, prove the relation 

u a 6u a =^{5E-Vt6i), 

from which the theorem follows. 
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Chapter 2 
Geodesic congruences 



Our purpose in this chapter is to develop the mathematical techniques required in 
the description of congruences, the term designating an entire system of noninter- 
secting geodesies. We will consider separately the cases of timelike geodesies and 
null geodesies. (The case of spacelike geodesies does not require a separate treat- 
ment, as it is virtually identical to the timelike case; it is also less interesting from 
a physical point of view.) We will introduce the expansion scalar, as well as the 
shear and rotation tensors, as a means of describing the congruence's behaviour. 
We will derive a useful evolution equation for the expansion, known as Raychaud- 
huri's equation. On the basis of this equation, we will show that gravity tends to 
focus geodesies, in the sense that an initially diverging congruence (geodesies flying 
apart) will be found to diverge less rapidly in the future, and that an initially con- 
verging congruence (geodesies coming together) will converge more rapidly in the 
future. And we will present Frobenius' theorem, which states that a congruence is 
hypersurface orthogonal — the geodesies are everywhere orthogonal to a family of 
hypersurfaces — if and only if its rotation tensor vanishes. 

The chapter begins (Sec. 2.1) with a review of the standard energy conditions 
of general relativity, since some of these are required in the proof of the focusing 
theorem. It continues (Sec. 2.2) with a simple introduction to the expansion scalar, 
shear tensor, and rotation tensor, based on the kinematics of a deformable medium. 
Congruences of timelike geodesies are then presented in Sec. 2.3, and the case of 
null geodesies is treated in Sec. 2.4. 

The techniques presented in this chapter are used in many different areas of 
gravitational physics. Most notably, they are used in the mathematical descrip- 
tion of event horizons, a topic covered in Chapter 5. They also play a key role 
in the formulation of the singularity theorems of general relativity, a topic that 
(unfortunately) is not covered in this book. 

2.1 Energy conditions 

2.1.1 Introduction and summary 

In the context of classical general relativity, it is reasonable to expect that the 
stress-energy tensor will satisfy certain conditions, such as positivity of the energy 
density and dominance of the energy density over the pressure. Such requirements 
are embodied in the energy conditions, which are summarized in Table 2.1. 

To put the energy conditions in concrete form it is useful to assume that the 
stress-energy tensor admits the decomposition 

T af > = pe%el + Pl eTe{ +p 2 e£ef +»,efef , (2.1.1) 
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Table 2.1: Energy conditions. 



Name Statement Conditions 

Weak T a pv a vP > p>0, p+pt>0 

Null T a pk a kP > P+Pi>0 

Strong (T aP - \Tg a p)v a v^ > p + £ 4 Pi > 0, p + ft > 

Dominant —T a pV® future directed p > 0, p > |ft| 

in which the vectors eJJ form an orthonormal basis; they satisfy the relations 

9ape«eP = ri^, (2.1.2) 

where = diag(— 1, 1, 1, 1) is the Minkowski metric. (It goes without saying 
that the basis vectors are functions of the coordinates.) Equations (2.1.1) and 
(2.1.2) imply that the quantities p (energy density) and Pi (principal pressures) are 
eigenvalues of the stress-energy tensor, and ejj are the normalized eigenvectors. 

The inverse metric can neatly be expressed in terms of the basis vectors. It is 
easy to check that the relation 

g<*P = rre%eP v , (2.1.3) 

where = diag(— 1, 1, 1, 1) is the inverse of ?j M „, is compatible with Eq. (2.1.2). 
Equations such as (2.1.3) are called completeness relations. 

If the stress-energy tensor is that of a perfect fluid, then p± = p2 = pz = p. 
Substituting this into Eq. (2.1.1) and using Eq. (2.1.3) yields 

T a ? = pe%e%+p{e?ee +e£ef + e 3 a ef) 

The vector is identified with the four- velocity of the perfect fluid. 

Some of the energy conditions are formulated in terms of a normalized, future- 
directed, but otherwise arbitrary timelike vector v"; this represents the four- velocity 
of an arbitrary observer in spacetime. In terms of the orthonormal basis, such a 
vector can be expressed as 

v a = 7 (e£ + ae? + 6e 2 , + ce£), 7 = (1 - a 2 - b 2 - c 2 )" 1/2 , (2.1.4) 

where a, 6, and c are arbitrary functions of the coordinates, such that a 2 +b 2 +c 2 < 1. 
We will also need an arbitrary, future-directed null vector k a . This we shall express 
as 

k a = e%+a'e ( {+b l e%+c l e%, (2.1.5) 

where a', b', and c' are arbitrary functions of the coordinates, such that a' 2 + b' 2 + 
c' 2 = 1. Recall that the normalization of a null vector is always arbitrary. 



2.1.2 Weak energy condition 



The weak energy condition states that the energy density of any matter distribu- 
tion, as measured by any observer in spacetime, must be nonnegative. Because an 
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observer with four- velocity v a measures the energy density to be T a pv a v 13 , we must 
have 

T aP v a v p > (2.1.6) 

for any future-directed timelike vector v a . To put this in concrete form we substitute 
Eqs. (2.1.1) and (2.1.4), which gives 

p + a 2 pi + b 2 p 2 + c 2 p 3 > 0. 

Because a, b, c, are arbitrary, we may choose a = b = c = 0, and this gives p > 0. 
Alternatively, we may choose b = c = 0, which gives p + a 2 p\ > 0. Recalling that 
a 2 must be smaller than unity, we obtain < p + a 2 pi < p+pi- So p + pi > 0, and 
similar expressions hold for pi and p 3 ■ The weak energy condition therefore implies 

p>0, P+Pi>0. (2.1.7) 

2.1.3 Null energy condition 

The null energy condition makes the same statement as the weak form, except that 
v a is replaced by an arbitrary, future-directed null vector k a . Thus, 

T a/3 k a k p > (2.1.8) 

is the statement of the null energy condition. Substituting Eqs. (2.1.1) and (2.1.5) 

gives 

p + a' 2 pi + b' 2 p2 + c' 2 p 3 > 0. 

Choosing b' = c' = enforces a' = 1, and we obtain p + pi > 0, with similar 
expressions holding for p 2 and p 3 . The null energy condition therefore implies 

P+Pi>0. (2.1.9) 

Notice that the weak energy condition implies the null form. 

2.1.4 Strong energy condition 
The statement of the strong energy condition is 

(T a f } -\Tg a f } )v a v*>0, (2.1.10) 

or T a pv a vP > — |T, where v a is any future-directed, normalized, timelike vector. 
Because T a p — \Tg a p = R a p/&-K by virtue of the Einstein field equations, the 
strong energy condition is really a statement about the Ricci tensor. Substituting 
Eqs. (2.1.1) and (2.1.4) gives 

1 2 (P + a 2 Pi + b 2 P2 + c 2 p 3 ) >^(p-pi~P2-P3)- 

Choosing a = b = c = enforces 7=1, and we obtain p + pi + P2 + p 3 > 0. 
Alternatively, choosing b = c = implies -y 2 = 1/(1 — a 2 ), and after some simple 
algebra we obtain p+ pi +P2 +p 3 > a 2 (P2 + p 3 — p — Pi). Because this must hold 
for any a 2 < 1, we have p + Pi > 0, with similar relations holding for p 2 and p 3 . 
The strong energy condition therefore implies 

P+Pi +P2 +P3 > 0, p+Pi>0. (2.1.11) 

It should be noted that the strong energy condition does not imply the weak form. 



26 



Geodesic congruences 



2.1.5 Dominant energy condition 

The dominant energy condition embodies the notion that matter should flow along 
timelike or null world lines. Its precise statement is that if v a is an arbitrary, 
future-directed, timelike vector field, then 

—T a pvP is a future-directed, timelike or null, vector field. (2.1.12) 

The quantity —T^v 13 is the matter's momentum density as measured by an observer 
with a four-velocity v a , and this is required to be timelike or null. Substituting 
Eqs. (2.1.1) and (2.1.4) and demanding that —T^v 13 not be spacelike gives 

2 22 j.2 2 2 2 ^ n 

p - a pi - b p 2 - c p 3 > 0. 

Choosing a = b = c = gives p 2 > 0, and demanding that —T^v 13 be future 
directed selects the positive branch: p > 0. Alternatively, choosing b = c = gives 
p 2 > a 2 pi 2 . Because this must hold for any a 2 < 1, we have p > \pi\, having taken 
the future direction for —T a pV®. Similar relations hold for p 2 and p$. The dominant 
energy condition therefore implies 

P > 0, P > \Pi\- (2.1.13) 

2.1.6 Violations of the energy conditions 

While the energy conditions typically hold for classical matter, they can be violated 
by quantized matter fields. A well-known example is the Casimir vacuum energy 
between two conducting plates separated by a distance d: 

- -I^]L 
P ~ 720 d 4 ' 

Although quantum effects allow for a localized violation of the energy conditions, 
recent work suggests that there is a limit to the amount by which the energy con- 
ditions can be violated globally. In this context it is useful to formulate averaged 
versions of the energy conditions. For example, the averaged null energy condition 
states that the integral of T a pk a kP along a null geodesic 7 must be nonnegative: 

T a pk a k p dX > 0. 

Such averaged energy conditions play a central role in the theory of traversable 
wormholes. The averaged null energy condition is known to always hold in flat 
spacetime, for noninteracting scalar and electromagnetic fields in arbitrary quantum 
states; this is true in spite of the fact that T a pk a kP can be negative somewhere along 
the geodesic. Its status in curved spacetimes is not yet fully settled. A complete 
discussion, as of 1994, can be found in Matt Visser's book. 

2.2 Kinematics of a deformable medium 

2.2.1 Two-dimensional medium 

As a warmup for what is to follow, consider, in a purely Newtonian context, the 
internal motion of a two-dimensional deformable medium. (Picture this as a thin 
film of Jell-O; see Fig. 2.1.) How the medium actually moves depends on its internal 
dynamics, which will remain unspecified for the purpose of this discussion. From a 
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Figure 2.1: Two-dimensional deformable medium. 



purely kinematical point of view, however, we may always write that for a sufficiently 
small displacement £ a about a reference point 0, 

for some tensor B a b . The time dependence of this tensor is determined by the 
medium's dynamics. For short time intervals, 



where 



£°(fi)=£ (fo)+A£ (to), 
Ae = B a b (t )e(t )At + O{At 2 ), 



and At = t± — to- To describe the action of B a b we will consider the simple figure 
— a circle — described by £°(to) = ro(cos cf>, sin^>). 

2.2.2 Expansion 
Suppose first that B a b is proportional to the identity matrix, so that 

B a_(¥ 

V \Q 

where 6 = B a a . Then = ^6roAt(cos<j), sin^), which corresponds to a change 
in the circle's radius: n = ro + \6rtyAt. The corresponding change in area is then 
AA = Ai - A = irr 2 9At, so that 

A At 

The quantity 6 is therefore the fractional change of area per unit time; we shall call 
it the expansion parameter. This is actually a function, as 6 may depend on time 
and on the choice of reference point 0. 



2.2.3 Shear 
Suppose next that B a b is symmetric and tracefree, so that 



<7 X -CT+ 
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Then A£° = roAt(a + cos <f> + a x sin ij>, —a + sin <j> + a x cos </>). The parametric equa- 
tion describing the new figure is ri((p) = r (l + <7 + Aicos2^> + ct x Ai sin2^>). If 
o x = 0, this represents an ellipse with major axis oriented along the <j> = di- 
rection (Fig. 2.2). If, on the other hand, a + = 0, then the ellipse's major axis is 
oriented along <j> = n/4. The general situation is an ellipse oriented at an arbitrary 
angle. It is easy to check that the area of the figure is not affected by the transfor- 
mation. What we have, therefore, is a shearing of the figure, and a+ and a x are 
called the shear parameters. These may also vary over the medium. 



2.2.4 Rotation 
Finally, we suppose that B a b is antisymmetric, so that 



LJ 
-LJ 



Then A£° = r o wAt(sin0, — cos^), and the new displacement vector is = 
ro(cos^', sin (/>'), where </>' = <j> — uiAt. This clearly represents an overall rotation 
of the original figure, which also leaves its area unchanged; ui is called the rotation 
parameter. 

2.2.5 General case 

The most general matrix B a b has 2x2 = 4 components, and it may be expressed 
as 

The action of this most general tensor is a linear combination of expansion, shear, 
and rotation. The tensor can also be expressed as 

B a b = 2^ &ab + 0~ a b + W ft, 

where 6 = B a a (the expansion scalar) is the trace part of B a b, o"ot = -B(ot) — ¥^ ab 
(the shear tensor) is the symmetric-tracefree part of B a b, and uj a b = -B[o6] (the 
rotation tensor) is the antisymmetric part of B a b- 

2.2.6 Three-dimensional medium 
In three dimensions, the tensor B a b would be expressed as 

B a b = $ab + ^ ab + U a b, 
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where 8 = B a a is the expansion scalar, a ab = B^ ab ) — \65 ab the shear tensor, and 
uj ab = B[ ab ] the rotation tensor. In the three-dimensional case, the expansion is the 
fractional change of volume per unit time: 

i Ay 
v ~At' 

To see this, treat the three-dimensional relation 

e(ti) = (s a b +B a b At)e(t ) 

as a coordinate transformation from £°(io) to £°(ti). The Jacobian of this trans- 
formation is 

J = det[<5° 6 + B%At] 
= l+Tr[B a 6 At] 
= 1 + 6 At. 

This implies that volumes at to and t± are related by V± = (1 + 9At)Vo, so that 
Vo6 = (Vi — Vq)I At. This argument shows also that the volume is not affected by 
the shear and rotation tensors. 

2.3 Congruence of timelike geodesies 

Let (f be an open region in spacetime. A congruence in @ is a family of curves such 
that through each point in @ there passes one and only one curve from this family. 
(The curves do not intersect; picture this as a tight bundle of copper wires.) In this 
section we will be interested in congruences of timelike geodesies, which means that 
each curve in the family is a timelike geodesic; congruences of null geodesies will be 
considered in the following section. We wish to determine how such a congruence 
evolves with time. More precisely stated, we want to determine the behaviour of the 
deviation vector £ a between two neighbouring geodesies in the congruence (Fig. 2.3), 
as a function of proper time r along the reference geodesic. The geometric setup is 
the same as in Sec. 1.10, and the relations 

U a U a = -1, U a . p U p = 0, U a .^ = ^ p U P , U a £ a = 0, 

where u a = dx a /dr is tangent to the geodesies, will be assumed to hold. Notice in 
particular that is orthogonal to u a : the deviation vector points in the directions 
transverse to the flow of the congruence. 

2.3.1 Transverse metric 

Given the congruence and the associated timelike vector field u a , the spacetime 
metric g a p can be decomposed into a longitudinal part —u a up and a transverse 
part h a p given by 

= g a f) +u a up. (2.3.1) 

The transverse metric is purely "spatial", in the sense that it is orthogonal to 
u a : u a h a p = = hapu 13 . It is effectively three-dimensional: in a comoving 
Lorentz frame at some point P within the congruence, u a = (—1,0,0,0), g a p = 
diag(— 1, 1, 1, 1), and h a p = diag(0, 1, 1, 1). We may also note the relations h a a = 3 
and h^h^ = h a p . 
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Figure 2.3: Deviation vector between two neighbouring members of a congruence. 

2.3.2 Kinematics 
We now introduce the tensor field 

B a /3 = u a; p. (2.3.2) 

Like h a p, this tensor is purely transverse, as u a B a p = u a u a -p = \{u a u a )-p = 
and B a puP = u a -pu^ = 0. It determines the evolution of the deviation vector: from 
^"pU 13 = u a p£P we immediately obtain 

£<X = B a p e, (2-3.3) 

and we see that B a p measures the failure of to be parallel transported along the 
congruence. 

Equation (2.3.3) is directly analogous to the first equation of Sec. 2.2. We may 
decompose B a p into trace, symmetric-tracefree, and antisymmetric parts. This 
gives 

B a /3 = g# h a p + a a p + Lo a p, (2.3.4) 

where 8 = B a a = u a . a is the expansion scalar, u a p = B^^ — ^O h a p the shear tensor, 
and Lu aj 3 = B[ a p] the rotation tensor. These quantities come with the same inter- 
pretation as in Sec. 2.2. In particular, the congruence will be diverging (geodesies 
flying apart) if 6 > 0, and it will be converging (geodesies coming together) if 8 < 0. 

2. 3. 3 Frobenius ' theorem 

Some congruences have a vanishing rotation tensor, u) a p = 0. These are said to be 
hypersurface orthogonal, meaning that the congruence is everywhere orthogonal to 
a family of spacelike hypersurfaces foliating @ (Fig. 2.4). We now provide a proof 
of this statement. 

The congruence will be hypersurface orthogonal if u a is everywhere proportional 
to n a , the normal to the hypersurfaces. Supposing that these are described by 
equations of the form $(x a ) = c, where c is a constant specific to each hypersurface, 
then n a oc 3> jC( and 

Ua = 

for some proportionality factor /<. (We suppose that $ increases toward the future, 
and the positive quantity /* can be determined from the normalization condition 
u a u a = —1.) Differentiating this equation gives u a -p = —H^- a p — ^,«/*,/3- Consider 
now the completely antisymmetric tensor 
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Figure 2.4: Family of hypersurfaces orthogonal to a congruence of timelike geode- 
sies. 



Direct evaluation of the right-hand side, using §-p a = $ ;a/ g, returns zero. We 
therefore have 

hypersurface orthogonal => U[a;P u -/] = 0- (2.3.5) 

The converse of this statement, that U[ a .pu^] = implies the existence of a scalar 
field 3> such that u a oc 3> jC( , is also true (but harder to prove). 

Equation (2.3.5) is a useful result, because whether or not u a is hypersurface 
orthogonal can be decided on the basis of the vector field alone, without having 
to find $ explicitly. We note that the geodesic equation u a ^uP = was never 
used in the derivation of Eq. (2.3.5). We also never used the fact that u a was 
normalized. Equation (2.3.5) is therefore quite general: A congruence of curves 
(timelike, spacelike, or null) is hypersurface orthogonal if and only if U[ a .pU^] = 0, 
where u a is tangent to the curves. This statement is known as Frobenius' theorem. 

We now return to our geodesic congruence, and use Eqs. (2.3.2) and (2.3.4) to 
calculate 

3! U[ a;/ 3W 7 ] = 2(w [a;/3 ]W 7 + U[ T>a ]U/3 + tl[ft 7 ]tl a ) 
= 2 (B[ a/3 ]W 7 + B[ ja ]Up + Btf^Ua) 
= 2(w a/3 W 7 + U la U/3 + OJ/3-yUa). 

If we put the left-hand side to zero and multiply the right-hand side by u 7 , we obtain 
u a /3 = 0, because w 7Cl w 7 = = ujpjU 1 . (Recall the purely transverse property of 
B a p.) Therefore, 

hypersurface orthogonal io a p = 0. (2.3.6) 

This concludes the proof of our initial statement. 

Notice that Eq. (2.3.6) holds for timelike geodesies only, whereas Eq. (2.3.5) 
is general. In fact, Eq. (2.3.6) could have been derived much more directly, but 
in doing so we would have bypassed the more general formulation of Frobenius' 
theorem. The direct proof goes as follows. 

If u a is hypersurface orthogonal, then u a = — p$, a for some scalars n and $. It 
follows from w a p = W[ a;j g] and the symmetry of $- a p that 

W„/3 = -$[ >a £* >j3 ] = \u [a H tf3] . 

A* 

But we know that uj a p must be orthogonal to u a , and the relation id a puP = implies 
fi : a = —(fJ, : puP)u a - This, in turn, establishes that the rotation tensor vanishes 
identically: u) a p = 0. 
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We have learned that /* must be constant on each hypersurface, because it varies 
only in the direction orthogonal to the hypersurfaces. Thus, n can be expressed as 
a function of and defining a new scalar $ = j d$, we find that u a is not 
only proportional to a gradient, it is equal to one: u a = — ^ Notice that if u a 
can be expressed in this form, then it automatically satisfies the geodesic equation: 

Wa;X=*;a/3*' /3 = *;/3a*' /3 = |(*' /3 */ 3 );a = |(w%);a = 0. 

In summary: 

A vector field u a (timelike, spacelike, or null, and not necessarily geodesic) 
is hypersurface orthogonal if there exists a scalar field $ such that u a oc 
which implies U[ a .pu^] = 0. If the vector field is timelike and geodesic, then it is 
hypersurface orthogonal if there exists a scalar field ^ such that u a = — \P, a , which 
implies u a p = W[ a; /3] = 0. 

2.3.4 Raychaudhuri's equation 

We now want to derive an evolution equation for 6, the expansion scalar. We begin 
by developing an equation for B a p itself: 

= (Ua;n utl );f3 ~ u a;n u ^-p ~ Ravp^Vi} 1 

= —B afl B' J 'p — R aix p v u^u v . 
The equation for 6 is obtained by taking the trace: 

^ = -B^Bp a - R a pu a u p . 

It is then easy to check that B a/3 Bp a = \6 2 + a af3 a a p - co al3 uj a p. Making the 
substitution, we arrive at 

d 4- = - \0 2 - ° a ^ a p + " a ^ a p ~ RafiU a u p . (2.3.7) 
ar 6 

This is Raychaudhuri's equation for a congruence of timelike geodesies. We note that 
since the shear and rotation tensors are purely spatial, a a ^a a p > and uj al3 uj a p > 0, 
with the equality sign holding if and only if the tensor is identically zero. 

2.3.5 Focusing theorem 

The importance of Eq. (2.3.7) for general relativity is revealed by the following 
theorem: Let a congruence of timelike geodesies be hypersurface orthogonal, so that 
w «/3 = 0, and let the strong energy condition hold, so that (by virtue of the Einstein 
field equations) R a pu a u^ > 0. Then the Raychaudhuri equation implies 

f- = - l -6 2 - 0^0 a p - R a pu a u? < 0. 
dr 3 ap ap _ 

The expansion must therefore decrease during the congruence's evolution. Thus, an 
initially diverging (8 > 0) congruence will diverge less rapidly in the future, while 
an initially converging (6 < 0) congruence will converge more rapidly in the future. 
This is the statement of the focusing theorem. Its physical interpretation is that 
gravitation is an attractive force when the strong energy condition holds, and the 
geodesies get focused as a result of this attraction. 

It also follows from Raychaudhuri's equation that under the conditions of the 
focusing theorem, dO/dr < — 10 2 . This can be integrated at once, giving 
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where Oq =9(0). This shows that if the congruence is initially converging (6 < 0), 
then 9(t) — > — oo within a proper time r < 3/|#o|- The interpretation of this result 
is that the congruence will develop a caustic, a point at which some of the geodesies 
come together (Fig. 2.5). Obviously, a caustic is a singularity of the congruence, 
and equations such as (2.3.7) lose their meaning at such points. 



2. 3. 6 Example 

As an illustrative example, let us consider the congruence of comoving world lines 
in an expanding universe with metric 

ds 2 = -dt 2 + a 2 (t) (dx 2 + dy 2 +dz 2 ), 

where a(t) is the scale factor. The tangent vector field is u a = —d a t, and a quick 
calculation reveals that 

&aP — u a -p — - n a p, 
a 

where an overdot indicates differentiation with respect to t. This shows that the 
shear and rotation tensors are both zero for this congruence. The expansion, on the 
other hand, is given by 

a a 6 dt 

This illustrates rather well the general statement (made in Sec. 2.3.8 below) that 
the expansion is the fractional rate of change of the congruence's cross-sectional 
volume (which is here proportional to a 3 ). 



2. 3. 7 Another example 

As a second example, we consider a congruence of radial, marginally bound, timelike 
geodesies of the Schwarzschild spacetime. The metric is 

ds 2 = -fdt 2 + f- 1 dr 2 + r 2 dti 1 , 

where / = 1 - 2M/r and dti 1 = dd 2 + sin 2 6 d<j> 2 . For radial geodesies, u e =u^ = 0, 
and the geodesies are marginally bound if 1 = E = — u a £^ = —ut- This means 
that the conserved energy is precisely equal to the rest-mass energy, and this gives 
us the equation u l = 1/f. From the normalization condition g a j3U°'u' 3 = — 1 we 
also get u r = ±\j2M/r; the upper sign applies to outgoing geodesies, and the lower 
sign applies to ingoing geodesies. 

The four- velocity is therefore given by 



u a d a = J' 1 d t ± \JlMjr d r , u a dx a = -dt ± f' 1 ^2M/r dr. 
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It follows that u a is equal to a gradient: u a = — $ jCn where 
$ = t T 4M 



V^/2M 

This means that the congruence is everywhere orthogonal to the spacelike hyper- 
surfaces $ = constant. 

The expansion is calculated as 

9 = u« = -L= (J=qu a ) =i(r 2 i 



where a prime indicates differentiation with respect to r. Completing the calculation 
gives 

Not surprisingly, the congruence is diverging (6 > 0) if the geodesies are outgoing, 
and converging (6 < 0) if the geodesies are ingoing. The rate of change of the 
expansion is calculated as dO/dr = (dO j dr)(dr / dr) = 6'u r , and the result is 

d6_ _ 9M 
dr ~ ~2r3' 

As dictated by the focusing theorem, dO/dr is negative in both cases. 

2. 3. 8 Interpretation of 9 

We now prove that 8 is equal to the fractional rate of change of SV , the congruence's 
cross-sectional volume: 

Although this may already be obvious from Eqs. (2.3.3) and (2.3.4), it is still in- 
structive to go through a formal proof. The first step is to introduce the notions of 
cross section, and cross-sectional volume. 

Select a particular geodesic 7 from the congruence, and on this geodesic, pick 
a point P at which r = Tp. Construct, in a small neighbourhood around P, a 
small set 6T,(tp) of points P' such that (i) through each of these points there passes 
another geodesic from the congruence, and (ii) at each point P', r is also equal to 
Tp. This set forms a three-dimensional region, a small segment of the hypersurface 
t = Tp (Fig. 2.6). We assume that the parameterization has been adjusted so that 
7 intersects 6T,(tp) orthogonally. (There is no requirement that other geodesies 
do, as the congruence may not be hypersurface orthogonal.) We shall call 6T,(tp) 
the congruence's cross section around the geodesic 7, at proper time r = Tp. We 
want to calculate the volume of this hypersurface segment, and compare it with the 
volume of 6T,(tq), where Q is a neighbouring point on 7. 

We introduce coordinates on 6T,(tp) by assigning a label y a (a = 1,2,3) to 
each point P' in the set. Recalling that through each of these points there passes 
a geodesic from the congruence, we see that we may use y a to label the geodesies 
themselves. By demanding that each geodesic keep its label as it moves away from 
6T,(tp), we simultaneously obtain a coordinate system y a in 6T,(tq) or any other 
cross section. This construction therefore defines a coordinate system (r, y a ) in a 
neighbourhood of the geodesic 7, and there exists a transformation between this 
system and the one originally in use: x a = x a (T,y a ). Because y a is constant along 
the geodesies, we have 

u »=( ?E-L\ . (2.3.9) 
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Figure 2.6: Congruence's cross section about a reference geodesic. 



On the other hand, the vectors 




(2.3.10) 



are tangent to the cross sections. These relations imply £ u e" = 0, and we also have 
u a e" = holding on 7 (and only 7). 

We now introduce a three-tensor /i o6 , defined by 

h ab = g a pe a a el (2.3.11) 

(A three-tensor is a tensor with respect to coordinate transformations y a — > y a ' , but 
a scalar with respect to transformations x a — > x a .) This acts as a metric tensor on 
6T,(t): For displacements contained within the set (so that dr = 0), x a = x a (y a ), 
and 

ds 2 = g a /3 dx a dx 13 

- «(t>)(t>) 

= { 9af) e a a el)dy a dy b 
= h ab dy a dy b . 

Thus, hat, is the three-dimensional metric on the congruence's cross sections. Be- 
cause 7 is orthogonal to its cross sections (u a e" = 0), we have that /i f, = h a p e"e^ 
on 7, where h a p = g a p + u a up is the transverse metric. If we define h ab to be the 
inverse of /i f,, then it is easy to check that 

h aP = h ab e a a el (2.3.12) 

on 7. 

The three-dimensional volume element on the cross sections, or cross-sectional 
volume, is 5V = Vhd 3 y, where h = det[h a b]. Because the coordinates y a are 
comoving (since each geodesic moves with a constant value of its coordinates), d 3 y 
does not change as the cross section <5S(r) evolves from r = tp to r = tq. A change 
in 5V therefore comes entirely from a change in \fh: 
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We must now calculate the rate of change of the three-metric: 
^ = ( a ne <* e P\ u » 

= fcd(eS/)^+S^e;(e!/) 

= {B aP + B Pa )e a a el (2.3.13) 
Multiplying by h ab and evaluating on 7, so that Eq. (2.3.12) may be used, we obtain 
h * b dh^ = {Bap+Bpa){h « be ay b) 
= 2B aP h a P 

= 26. 



This proves that 
which is the same statement as in Eq. (2.3.8). 



»=±£Vk, (2.3.14) 



2.4 Congruence of null geodesies 

We now turn to the case of null geodesies. The geometric setup is the same as in 
the preceding section, except that the tangent vector field, denoted k a , is null. We 
assume that the geodesies are affinely parameterized by A, so that k a = dx a /d\. 
The deviation vector will again be denoted £ a , and we again take it to be orthogonal 
to, and Lie transported along, the geodesies. The following equations therefore hold: 

k a k a = 0, k a .^k^ = 0, k a .^^ = £ a .pk^] fc a £a ~ 0* 

As in the preceding section, we will be interested in the transverse properties of 
the congruence, which are described by the deviation vector £ a . We can, however, 
anticipate some difficulties, because here the condition k a £ a = fails to remove an 
eventual component of £ a in the direction of k a . One of our first tasks, therefore, 
will be to isolate the purely transverse part of the deviation vector. This we will do 
with the help of h a p, the transverse metric. 



2.4-1 Transverse metric 

To isolate the part of the metric that is transverse to k a is not entirely straightfor- 
ward when k a is null. The expression h' a/3 = g a p + k a kp does not work, because 
h' a pk^ = k a ^ 0. To see what must be done, let us go to a local Lorentz frame at 
some point P, and let us introduce the null coordinates u = t — x and v =t + x. The 
line element can then be expressed as ds 2 = —dudv + dy 2 + dz 2 . Supposing that 
k a is tangent to the curves u = constant, we see that the transverse line element is 
ds 2 = dy 2 + dz 2 : the transverse metric is two-dimensional. This clearly has to do 
with the fact that ds 2 = for displacements along the v direction. 

To isolate the transverse part of the metric we need to introduce another null 
vector field N a , such that N a k a ^ 0. Because the normalization of a null vector is 
arbitrary, we may always impose k a N a = — 1. If k a = —d a u in the local Lorentz 
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frame, then we might choose N a = — ^d a v. Now consider the object h a p = g a p + 
k a Np + N a kp. This is clearly orthogonal to both k a and N a : h^k 13 = h^N 13 = 0. 
Furthermore, h a p = diag(0,0, 1, 1) in the local Lorentz frame, and h a jj is properly 
transverse and two-dimensional. This, therefore, is the object we seek. 

Let us now summarize the preceding discussion. For a congruence of null 
geodesies, the transverse metric is obtained as follows: Given the null vector field 
k a , select an auxiliary null vector field N a and choose its normalization to be such 
that k a N a = — 1. Then the transverse metric is given by 

h a p = g a p + k a N p + N a k p . (2.4.1) 

It satisfies the relations 

hapk? = h aP N? = 0, h a a =2, h%h^ = h% (2.4.2) 

which confirm that h a p is purely transverse (orthogonal to both k a and N a ) and 
effectively two-dimensional. 

Evidently, the conditions N a N a = and k a N a = — 1 do not determine N a 
uniquely. This implies that the transverse metric is not unique. As we shall see, 
however, quantities such as the expansion of the congruence will turn out to be the 
same for all choices of auxiliary null vector. Further aspects of this non-uniqueness 
are explored in Sec. 2.6, Problem 6. 

2.4-2 Kinematics 
As before, we introduce the tensor field 

B aP = k a;P (2.4.3) 
as a measure of the failure of £ a to be parallel transported along the congruence: 

^=B^e- (2-4.4) 

As before, B a p is orthogonal to the tangent vector field: k a B a p = = B a pk^ '. 
However, B a p is not orthogonal to N a , and Eq. (2.4.4) has a non-transverse com- 
ponent that should be removed. 

We begin by isolating the purely transverse part of the deviation vector, which 
we denote £ a . Because h a p is itself purely transverse, it is easy to see that 

|a = h a^ = ^ + ( V)fc a (2 .4.5) 

is the desired object. Its covariant derivative in the direction of k a represents the 
relative velocity of two neighbouring geodesies. It is given by 

l^k^h^B^e + h^k^, 

where we have inserted Eq. (2.4.4) in the first term of the right-hand side. Calcu- 
lating the second term gives 

= + {N^k^k^ 

and we see that the vector ^.pk 13 has a component along k^ . Once again we remove 
this by projecting with h a ^. Using the last of Eqs. (2.4.2), we obtain 

= h^h^B^e 
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for the transverse components of the relative velocity. In the first line we have 
replaced with because B^ v k v = 0. In the third line we have inserted the 
relation = h"p£?; this holds because is already purely transverse. 
We have obtained 

(I^T = B^l* (2.4.6) 

where 

= hWpB^ (2.4.7) 

is the purely transverse part of = k^ v . This can be expressed in a more explicit 
form by using Eq. (2.4.1): 

B*p = (<?/ +k a N» +N a k»){g p v + kpN v + Npk v )B llv 
= (fl a " + fcaJV" + NakfXBtf + kpB^N") 
= B a p + k a N»B fl p + kpB afl N» + k a kpB flv N>*N''. (2.4.8) 

Equation (2.4.6) governs the purely transverse behaviour of the null congruence, 
and the vector B a p£? can be interpreted as the transverse relative velocity between 
two neighbouring geodesies. 

As before, the evolution tensor B a p will be decomposed into its irreducible parts: 

B a p = 2# + CT a /3 + w aj g, (2.4.9) 

where 6 = B a a is the expansion scalar, o~ a p = B{ a p) ~ \®hap the shear tensor, and 
ui a p = B[ a p] the rotation tensor. The expansion is given more explicitly by 

e = 9 af} B af} 

= 9 ap B aP , 

which follows from Eq. (2.4.8) and the fact that B a p is orthogonal to k a . From this 
we obtain 

6 = k a . a . (2.4.10) 

We see explicitly that 6 does not depend on the choice of auxiliary null vector N a : 
the expansion is unique. The geometric meaning of the expansion will be considered 
in detail below; we will show that 6 is the fractional rate of change (per unit affine- 
parameter distance) of the congruence's cross-sectional area. (Recall that here, the 
transverse space is two-dimensional.) 

2.4-3 Frobenius' theorem 

We now show that if the vector field k a is such that uj a p = 0, then the congruence 
is hypersurface orthogonal, in the sense that k a must be proportional to the normal 
$, a of a family of hypersurfaces described by $(x a ) = c. These hypersurfaces must 
clearly be null: g a/3 ^^ t p oc g a/3 k a kp = 0. Furthermore, because k a is at once 
parallel and orthogonal to $, a (fc a $ jC , = 0), the vector k a is also tangent to the 
hypersurfaces. The null geodesies therefore lie within the hypersurfaces (Fig. 2.7); 
they are called the null generators of the hypersurfaces $(x a ) = c. 

We begin with the general statement of Frobenius' theorem derived in Sec. 2.3.3: 
the congruence is hypersurface orthogonal if and only if fc[ a;i afc 7 ] = 0. This condition 
implies B[ a p]k~, + B^^kp + B[p^k a = 0, and transvecting with TV 7 gives 

B [a p] = B [ia] kpW + B m h a Ni 

= \ {B ia kp - B ai kp + Bp^ka - B 1 pk a )N 1 
= B l[a kp } Ni + k [a Bp h Ni . 
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Figure 2.7: Family of hypersurfaces orthogonal to a congruence of null geodesies. 



But from Eq. (2.4.8) we also have 

B [a/3] = B [al3] ~ B tl [ a kfi]N fl - k^Bp^N* 1 , 

and it follows immediately that B[ a p] = 0. We therefore have 

hypersurface orthogonal ui a p = 0, (2.4.11) 

and this concludes the proof. (In Sec. 2.6, Problem 6 we show that if ui a p = for 
a specific choice of auxiliary null vector N a , then cj a p = for all possible choices.) 

The congruence is hypersurface orthogonal if there exists a scalar field 
which is constant on the hypersurfaces and k a = — fi$ : a for some scalar fx. In this 
form, k a automatically satisfies the geodesic equation: 

= -{n^)k a , 

where we have used $ ;a/ 3$ ,/3 = $ ;/ 3 a $ ,/3 = |($ j/ 3$ ,/3 ) jCe = 0. This is the general 
form of the geodesic equation, corresponding to a parameterization that is not affine. 
Affine parameterization is recovered if = 0, that is, if /* does not vary along 

the geodesies. 

2.4-4 Raychaudhuri's equation 

The derivation of the null version of Raychaudhuri's equation proceeds much as in 
Sec. 2.3.4. In particular, the equation 

^ = -B^Bp a - R aP k a k p 

follows from the same series of steps. It is then easy to check that B a ^Bp a = 
B a PB Pa = \d 2 + a af) a a p - co aP co a p, which gives 

|[ = Af - o a ^o a p +to a ?to a p - R a pk a k^. (2.4.12) 

This is Raychaudhuri's equation for a congruence of null geodesies. It should be 
noted that this equation is invariant under a change of auxiliary null vector N a ; 
this is established in Sec. 2.6, Problem 6. We also note that because the shear 
and rotation tensors are purely transverse, a a ^a a p > and LU a ^uj a p > 0, with the 
equality sign holding if and only if the tensor vanishes. 
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2.4-5 Focusing theorem 

The null version of the focusing theorem goes as follows: Let a congruence of null 
geodesies be hypersurface orthogonal, so that Lo a p = 0, and let the null energy 
condition hold, so that (by virtue of the Einstein field equations) Rapk 01 ^ > 0. 
Then the Raychaudhuri equation implies 

|J = Af - ° af> ° a p - R aP k a kP < 0, 

which means that the geodesies are focused during the evolution of the congruence. 
Integrating d9/dX < —\6 2 yields 

e- 1 (\)>e - 1 + ^, 

where 0q = 6(0). This shows that if the congruence is initially converging (0 O < 0), 
then 9(X) — > — oo within an affine parameter A < 2/|0 o |- As in the case of a timelike 
congruence, this generally signals the occurrence of a caustic. 

2-4-6 Example 

As an illustrative example, let us consider the congruence formed by the generators 
of a null cone in flat spacetime. The geodesies emanate from a single point P (which 
we place at the origin of the coordinate system) and they radiate in all directions. 
(Note that P is a caustic of the congruence.) In spherical coordinates, the geodesies 
are described by the relations t = A, r = A, 6 = constant, and cf> = constant, in 
which A is the affine parameter. The tangent vector field is 

k a = -d a (t - r). 

We must find an auxiliary null vector field N a that satisfies k a N a = — 1. If 
we assume that N a lies entirely within the (t, r) plane, the unique solution is 
N a = —\d a (t + r). With this choice we find that the transverse metric is given 
by h aj 3 = diag(0, 0, r 2 , r 2 sin 2 9). A straightforward calculation gives B a p = k a .p = 
diag(0, 0, r, r sin 2 6), and we see that B a p is already transverse for this choice of N a . 
We have found 

B a p = - h a p, 
r 

and this shows that the shear and rotation tensors are both zero for this congruence. 
The expansion, on the other hand, is given by 

e= 2 -= i A (47rr 2). 

r 4nr 2 dX 

This verifies the general statement (made in Sec. 2.4.8 below) that the expansion is 
the fractional rate of change of the congruence's cross-sectional area. 

We might ask how making a different choice for N a would affect our results. It 
is easy to check that the vector N a dx a = —dt + r sin 6 dcf> satisfies both N a N a = 
and N a k a = — 1. It is therefore an acceptable choice of auxiliary null vector field. 
This choice leads to a complicated expression for the transverse metric, which now 
has components along t and r. And while the expression for B a p does not change, 
we find that B a p is no longer equal to B aj 3, and is much more complicated that 
the expression given previously. You may check, however, that the relation B a p = 
h a p/r is not affected by the change of auxiliary null vector. Our results for 6, a a p, 
and uj a p are therefore preserved. 
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2.4-7 Another example 

As a second example, we consider the radial null geodesies of Schwarzschild space- 
time. For d6 = dej) = 0, the Schwarzschild line element reduces to 

ds 2 = -fdt 2 + f- 1 dr 2 = -f(dt - f- 1 dr){dt + f' 1 dr), 

where / = 1 — 2M/r. The displacements will be null if ds 2 = 0. If we define 

u = t — r* , v =t + r*, 

where r* = J / _1 dr = r + 2Mln(r/2M — 1), we find that u = constant on outgoing 
null geodesies, while v = constant on ingoing null geodesies. The vector fields 

C* = ~d a u, k™ = -d a v 

are null, and they both satisfy the geodesic equation, with +r as an affine parameter 
for fc™ ut , and — r as an affine parameter for fcj". As their labels indicate, fc° ut is 
tangent to the outgoing geodesies, while k™ is tangent to the ingoing geodesies. 
The congruences are clearly hypersurface orthogonal. Their expansions are easily 
calculated: 

»-±i, 

r 

where the positive (negative) sign refers to the outgoing (ingoing) congruence. We 
also have 

c$ _ __2_ 

dX~ r 2 ' 

which is properly negative. 



2-4-8 Interpretation of 9 

We shall now give a formal proof of the statement that 6 is the fractional rate of 
change of the congruence's cross-sectional area: 

s 'MTx SA - < 2 - 4 - 13 ' 

where 6 A is measured in the purely transverse directions. The proof is very sim- 
ilar to what was presented in Sec. 2.3.8; the only crucial difference concerns the 
dimensionality of the transverse space. 

We pick a particular geodesic 7 from the congruence, and on this geodesic we 
select a point P at which A = Xp. We then consider the null curves to which N a 
is tangent, and we let yu be the parameter on these auxiliary curves; we adjust the 
parameterization so that n is constant on the null geodesies. The auxiliary curve 
that passes through P is called j3, and we have that at P, /* = /* 7 . The cross section 
6S(Xp) is defined to be a small set of points P' in a neighbourhood of P such that (i) 
through each of these points there passes another geodesic from the congruence and 
another auxiliary curve, and (ii) at each point P', X is also equal to Xp and n is equal 
to n 1 . This set forms a two-dimensional region, the intersection of small segments 
of the hypersurfaces X = Xp and fi = n 1 . We assume that the parameterization 
has been adjusted so that both 7 and /? intersect SS(Xp) orthogonally. (There is 
no requirement that other curves do.) 

We introduce coordinates in SS(Xp) by assigning a label 6 A (A = 1, 2) to each 
point in the set. Recalling that through each of these points there passes a geodesic 
from the congruence, we see that we may use 6 A to label the geodesies themselves. 
By demanding that each geodesic keep its label as it moves away from SS(Xp), we 
simultaneously obtain a coordinate system 9 A in any other cross section ^(A). This 
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construction therefore produces a coordinate system (X,fi, A ) in a neighbourhood 
of the geodesic 7, and there exists a transformation between this system and the 
one originally in use: x a = x a (X, /*, A ). Because /* and 6 A are constant along the 
geodesies, we have 

A " 1 ~dX 



On the other hand, the vectors 



A WL 

are tangent to the cross sections. These relations imply £k^A = 0; an d we have also 
that on 7 (and only 7), k a = N a = 0. 

The remaining steps are very similar to those carried out in Sec. 2.3.8, and it 
will suffice to present a brief outline. The two-tensor 

„_ ,, ~a J5 

OAB — 9a/3 <^A e B 

acts as a metric on 55(A). The cross-sectional area is therefore defined by S A = 
y/acPO, where a = det^^e]- The inverse a AB of the two-metric is such that on 7, 
h a P = a AB e^e^, where h a p = g a p + k a Np + N a kp is the transverse metric. The 
relation 

—jj- = (B a p + Bp a ) e A e p B 
follows, and taking its trace yields 

\fa dX 

This statement is equivalent to Eq. (2.4.13). 



2.5 Bibliographical notes 

During the preparation of this chapter I have relied on the following references: 
Carter (1979); Visser (1995); and Wald (1984). 
More specifically: 

Section 2.1 is based on Sec. 9.2 of Wald and Chapter 12 of Visser. Sections 2.3 
and 2.4 are based partially on Sec. 9.2 and Appendix B of Wald, as well as Sec. 6.2.1 
of Carter. 



2.6 Problems 

Warning: The results derived in Problem 8 are used in later portions of this book. 

1 . Consider a curved spacetime with metric 

ds 2 = -dt 2 +d£ 2 +r 2 (£)dn 2 , 

where the function r(t) is such that (i) it is minimum at £ = 0, with a value 
r , and (ii) it asymptotically becomes equal to \£\ as £ — > ±00. 

a) Argue that this spacetime contains a traversable wormhole between two 

asymptotically-flat regions, with a throat of radius ro- 

b) Find which energy conditions are violated at £ = 0. 
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2. We examine the congruence of comoving world lines of a Friedmann- Robert- 
son- Walker spacetime, whose metric is 

ds 2 = -dt 2 + a 2 (t) ( -^-r + r 2 dn 2 



l-kr 2 

where a(t) is the scale factor, and k a constant normalized to either ±1 or 
zero. The vector tangent to the congruence is u a = dx a /dt. 

a) Show that the congruence is geodesic. 

b) Calculate the expansion, shear, and rotation of this congruence. 

c) Use the Raychaudhuri equation to deduce 

- = - T (P + 3p). 

where p is the energy density of a perfect fluid with four- velocity u a , and 
p is the pressure. 

3. In this problem we consider the vector field 

in Schwarzschild spacetime; the vector is expressed in terms of the usual 
Schwarzschild coordinates, and M is the mass of the black hole. 

a) Show that the vector field is timelike and geodesic. Describe the geodesies 

to which u a is tangent. 

b) Calculate the expansion of the congruence. Explain why the expansion is 

positive in the northern hemisphere and negative in the southern hemi- 
sphere. Explain also why the expansion is singular at the north and 
south poles. 

c) Compute the rotation tensor for this congruence. Check that its square is 

given by 

p _ M n-6M/r ^ 2 

d) Calculate dO/dr and check that Raychaudhuri's equation is satisfied. 

4. Derive the following evolution equations for the shear and rotation tensors of 
a congruence of timelike geodesies: 

2 1 

eafcuW 1 = --9a a p-a afl a^ /3 -uj afl ui^ + -(a^ l 'a l _ ll ,-oj^ l 'oJ l _ ll ,)h a p 



-C a ^uV + £i£J, 



1 

r 

2 



3 

Here, C ati p v is the Weyl tensor (Sec. 1.13, Problem 8), and i?JJ = i?^ — 
\{h^ v 'R^ v )h a p is the "transverse-tracefree" part of the Ricci tensor; its trans- 
verse part is R^p = h^h^R^. 

5. In this problem we consider a spacetime with metric 

ds 2 = -dt 2 + r2+ r f + ™f 6 dr 2 + (r 2 + a 2 cos 2 9) d9 2 + (r 2 + a 2 ) sin 2 9 dcf> 2 , 
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where a is a constant, together with a congruence of null geodesies with tan- 
gent vector field 

k a d a = d t + d r + -5- — t 84. 

r z + a z 

a) Check that k a is null, that it satisfies the geodesic equation, and that r is 

an affine parameter. 

b) Find a suitable auxiliary null vector N a and calculate the congruence's 

expansion, shear, and rotation. In particular, verify the following results: 

2r af} 2a 2 cos 2 8 

H + gt cos^ 8 (r z + a z cos z 8) z 

These show that the congruence is diverging, shear-free, and that it is 
not hypersurface orthogonal. 

c) Show that the coordinate transformation 

x = \Jr z +0? sin 8 cos <j>, y = \Jr z + a? sin 8 sin <j>, z = r cos 8 

brings the metric to the standard Minkowski form for flat spacetime. 
Express k a in this coordinate system. 

6. The auxiliary null vector field N a introduced in Sec. 2.4 is not unique, and in 
this problem we examine various consequences of this fact. For the purpose of 
this discussion we introduce vectors (^4 = 1,2) pointing in the two direc- 
tions orthogonal to both k a and N a , and we choose them to be orthonormal, 
so that they satisfy g a p e^e^ = Sab ■ We also introduce the 2x2 matrix 

Bab = B aj 3 e^e^, 

the projection of the tensor B a p = k a -p in the transverse space spanned by 
the vectors e A . In the following we shall use 5ab and 5 AB to lower and raise 
uppercase latin indices; for example, B AB = 5 AM 5 BN Bmn- 

a) Derive the following relations: 



h a/3 = § AB -ag^ = qAB gag/3 



B ' 



V — oabB , g g e A e B , u> H — u> e A e B , 

where g ab = \{B AB + B BA -8 5 AB ) and co AB = \{B AB -B BA ). These 
confirm that the tensors h a p, B a p, G a p, and uj a p are all orthogonal to 
both k a and N a . We now must determine how a change of auxiliary null 
vector field affects these results. 

b) The vector N" must satisfy the relations N a N a = and k a N a = -1. 

Prove that the transformation 

N a _^ N *a = N a + ck a + c A e a A , 

where c = \c A c A , is the only one that preserves the defining relations 
for the auxiliary null vector. (The coefficients c A are arbitrary.) 

c) Calculate how h a ^ changes under this transformation. 

d) Calculate how B al3 changes. 

e) Show that 8 is invariant under the transformation. 
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f) Prove that a" 13 changes according to 

This shows that if a a/ 3 = for one choice of N a , then a a p = for any 
other choice. Prove that a a ^a a p is invariant under the transformation. 

g) Prove that to"' 3 changes according to 



laf) - {c A u A B ) k a e% - (c b cj b a ) e a A k? + u AB e a A e B . 



This shows that if oj a p = for one choice of N a , then co a/ 3 = for any 
other choice. Prove that LU a ^LU a p is invariant under the transformation. 

These results imply that the Raychaudhuri equation is invariant under a 
change of auxiliary null vector field. They also show that (jO a p = implies 
hypersurface orthogonality for any choice of N a . 

7. We now want to derive evolution equations for the shear and rotation tensors 
of a congruence of null geodesies. For this purpose it is useful to refer back to 
the basis k a , N a , e A , and the 2x2 matrix Bab = B a pi A e B , introduced in 
Problem 6. We shall also need 

Rab = R ap ,i3 v e^fc"e^fc I/ , T AB = e B »e A . v k v . 

Notice that Rab is a symmetric matrix, while is antisymmetric. Notice 
also that it is possible to set Tab = by choosing e A to be parallel transported 
along the congruence. 

a) First, derive the main evolution equation, 

dB-AB _ R R c p , r c R ipCR 

— 7T — — —OAQO b ~ U AB + 1 A a GB + 1 B a AG- 

d\ 

b) Second, decompose the various matrices into their irreducible parts, as 

Bab = -jO &ab + oab + ^>ab, Rab = &ab + Cab, 

where gab and Cab are both symmetric and tracefree, while loab is 
antisymmetric. Prove that ffl = R a pk a k^ and Cab = C ail $ v eAk^e B k v , 
where C ail p v is the Weyl tensor (Sec. 1.13, Problem 8). Then introduce 
the parameterization 

n _ ( v+ °x \ r _( C+ C x 
~{a x -* + )> Cab ~ \ C x -C + 

for the symmetric-tracefree matrices, and 



OJAB 



( -co o ) ' Tab = ( -r 



for the antisymmetric matrices. 

c) Third, and finally, derive the following explicit forms for the evolution 
equations, 

d£_ _1 



-6 2 -2(a+ 2 +a x 2 )+2co 2 - 



~d\ 

da x 
dX 

did 



-0a + -C + +2Ta x , 
-6a x -C x -2Ta+, 
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Check that the equation for 6 agrees with the form of Raychaudhuri's 
equation given in the text. Recall that we can always set T = by taking 
e'X to be parallel transported along the congruence; this eliminates the 
coupling between the shear parameters. 

8. Retrace the steps of Sec. 2.4, but without the assumption that the null 
geodesies are affinely parameterized. Show that: 

a) Equation (2.4.8) stays unchanged. 

b) The expansion is now given by 6 = k a . a — k, where k is defined by the 

relation k^^k* 3 = Kk a . 

c) Raychaudhuri's equation now takes the form 

^- = R e- l -6 2 - a a ^a aP + uj aP uj aP - R aP k a k p . 



Chapter 3 
Hypersurfaces 



This chapter covers three main topics that can all be grouped under the rubric 
of hypersurfaces, the term designating a three-dimensional submanifold in a four- 
dimensional spacetime. 

The first part of the chapter (Sees. 3.1 to 3.3) is concerned with the intrinsic 
geometry of a hypersurface, and it examines the following questions: Given that 
the spacetime is endowed with a metric tensor g a p, how does one define an induced, 
three-dimensional metric h a t, on a particular hypersurface? And once this three- 
metric has been introduced, how does one define a vectorial surface element that 
allows vector fields to be integrated over the hypersurface? While these questions 
admit straightforward answers when the hypersurface is either timelike or spacelike, 
we will see that the null case requires special care. 

The second part of the chapter (Sees. 3.4 to 3.6) is concerned with the extrinsic 
geometry of a hypersurface, or how the hypersurface is embedded in the enveloping 
spacetime manifold. We will see how the spacetime curvature tensor can be decom- 
posed into a purely intrinsic part — the curvature tensor of the hypersurface — 
and an extrinsic part that measures the bending of the hypersurface in spacetime; 
this bending is described by a three-dimensional tensor K a i known as the extrinsic 
curvature. We will see what constraints the Einstein field equations place on the 
induced metric and extrinsic curvature of a hypersurface. 

The third part of the chapter (Sees. 3.7 to 3.11) is concerned with possible 
discontinuities of the metric and its derivatives across a hypersurface. We will 
consider the following question: Suppose that a hypersurface partitions spacetime 
into two regions, and that we are given a distinct metric tensor in each region; does 
the union of the two metrics form a valid solution to the Einstein field equations? We 
will see that the conditions for an affirmative answer are that the induced metric and 
the extrinsic curvature must be the same on both sides of the hypersurface. Failing 
this, we will see that a discontinuity in the extrinsic curvature can be explained by 
the presence of a thin distribution of matter — a surface layer — at the hypersurface. 
(The induced metric can never be discontinuous: the hypersurface would not have a 
well-defined intrinsic geometry.) We will first develop the mathematical formalism 
of junction conditions and surface layers, and then consider some applications. 

3.1 Description of hypersurfaces 

3.1.1 Defining equations 

In a four-dimensional spacetime manifold, a hypersurface is a three-dimensional 
submanifold that can be either timelike, spacelike, or null. A particular hypersurface 
£ is selected either by putting a restriction on the coordinates, 

$(x a )= 0, (3.1.1) 
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Figure 3.1: A three-dimensional hypersurface in spacetime. 



or by giving parametric equations of the form 

x a = x a (y a ), (3.1.2) 

where y a (a = 1,2,3) are coordinates intrinsic to the hypersurface. For example, 
a two-sphere in a three-dimensional flat space is described either by $(x,y, z) = 
x 2 + y 2 + z 2 — R 2 = 0, where R is the sphere's radius, or by x = i?sin#cos0, 
y = R sin 6 sin <j>, and z = Rcosd, where 9 and <f> are the intrinsic coordinates. 
Notice that the relations x a (y a ) describe curves contained entirely in £ (Fig. 3.1). 

3.1.2 Normal vector 

The vector 3> jC( is normal to the hypersurface, because the value of $ changes 
only in the direction orthogonal to S. A unit normal n a can be introduced if the 
hypersurface is not null. This is defined by 



_ / -1 if 
nn a = £ =| +1 . f 



£ is spacelike , . 

£ is timelike ' ( } 



and we demand that n a point in the direction of increasing $: n a $ jCe > 0. It is 
easy to check that n a is given by 

(3.1.4) 



,1/2 



if the hypersurface is either spacelike or timelike. 

The unit normal if not defined when £ is null, because g**" $ :V is then equal 
to zero. In this case we let 

k a = -$,a (3-1.5) 

be the normal vector; the sign is chosen so that k a is future-directed when $ in- 
creases toward the future. Because k a is orthogonal to itself (k a k a = 0), this vector 
is also tangent to the null hypersurface S (Fig. 3.2). In fact, by computing k^k 13 
and showing that it is proportional to k a , we can prove that k a is tangent to null 
geodesies contained in S. We have k^pk 13 = $ ;a/ 3<l>' /3 = $ ;/ 3 Ce < I>' /3 = \{$£$'P\ a \ 
because is zero everywhere on S, its gradient must be directed along k a , 

and we have that ($ j/ 3$' /3 ) ;a = 2nk a for some scalar k. We have found that the 
normal vector satisfies 

uoc T./3 _ ua 

/V — Aj iv , 

the general form of the geodesic equation. The hypersurface is therefore generated 
by null geodesies, and k a = dx a /d\ is tangent to the generators. In general, the 
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Figure 3.2: A null hypersurface and its generators. 



parameter A is not afnne, but in special situations in which the relations $(x a ) = 
constant describe a whole family of null hypersurfaces (so that $ iJ g$' /3 is zero not 
only on £ but also in a neighbourhood around £), k = and A is an afnne parameter. 

When the hypersurface is null, it is advantageous to install on X a coordinate 
system that is well adapted to the behaviour of the generators. We therefore let the 
parameter A be one of the coordinates, and we introduce two additional coordinates 
6 A (A = 2, 3) to label the generators; these are constant on each generator, and 
they span the two-dimensional space transverse to the generators. Thus, we shall 
adopt 

y a = {\e A ) (3.1.6) 

when £ is null; varying A while keeping 6 A constant produces a displacement along 
a single generator, and changing 6 A produces a displacement across generators. 

3.1.3 Induced metric 

The metric intrinsic to the hypersurface £ is obtained by restricting the line element 
to displacements confined to the hypersurface. Recalling the parametric equations 
x a = x a (y a ), we have that the vectors 

° dy a 

are tangent to curves contained in £. (This implies that e"n a = in the non-null 
case, and e"k a = in the null case.) Now, for displacements within £, 

ds^ = g a /3 dx a dx 13 

(dx a \ (dx? , 

= h ab dy a dy b , (3.1.8) 



where 

Kb = 9 a pey b (3.1.9) 

is called the induced metric, or first fundamental form, of the hypersurface. It is 
a scalar with respect to transformations x a — > x a of the spacetime coordinates, 
but it transforms as a tensor under transformations y a — > y a of the hypersurface 
coordinates. We will refer to such objects as three-tensors. 

These relations simplify when the hypersurface is null and we use the coordinates 
of Eq. (3.1.6). Then ef = (dx a /d\) e A = k a , and it follows that /in = g a pk a k^ = 
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and h\A = 9 a pk a e A = 0, because by construction, = (dx a /d8 A )\ is orthogonal 
to k a . In the null case, therefore, 

dsl = gab d9 A d9 B , (3.1.10) 

where 

(Jab = g a pe A e p B , e a A = \^qaJ ■ (3.1.11) 

Here the induced metric is a two-tensor. 

We conclude by writing down completeness relations for the inverse metric. In 
the non-null case, 

g a(3 = £n a n P + h ab eyl, (3.1.12) 

where h ab is the inverse of the induced metric. Equation (3.1.12) is verified by 
computing all inner products between n a and e". In the null case we must introduce, 
everywhere on S, an auxiliary null vector field N a satisfying N a k a = — 1 and 
Na^A = ( see Sec. 2.4). Then the inverse metric can be expressed as 

g aP = _ k » N P _ N a k P + ^B e a e ^ (3.1.13) 

where a AB is the inverse of gab- Equation (3.1.13) is verified by computing all 
inner products between k a , N a , and e A . 



3.1.4 Light cone in flat spacetime 

An example of a null hypersurface in flat spacetime is the future light cone of an 
event P, which we place at the origin of a Cartesian coordinate system x a . The 
defining relation for this hypersurface is $ = t — r = 0, where r 2 = x 2 +y 2 +z 2 . The 
normal vector is k a = —d a (t — r) = (—l,x/r,y/r,z/r). A suitable set of parametric 
equations is t = A, x = A sin 8 cos <j>, y = A sin 6 sin <j>, and z = \cos6, in which 
y a = {\6,<j)) are the intrinsic coordinates; A is an affine parameter on the light 
cone's null generators, which move with constant values of 6 A = (6, <j>). 

From the parametric equations we compute the hypersurface's tangent vectors, 

dx a 

e a = = (l,sin#cos(/>, sin#shi(/>,cos0) = k a , 

dx a 

e 9 = -57- = (1j^cos0cos0, Acosflsin^, — Asinfl), 
Qx a 

e A = "ttt = (1; — A sin sin 0, A sin cos <f>, 0). 
v acp 

You may check that these vectors are all orthogonal to k a . Inner products between 
e a and define the two-metric oab, and we find 

(jab d9 A d6 B = \ 2 (d6 2 + sin 2 6 dcjj 2 ). 

Not surprisingly, the hypersurface has a spherical geometry, and A is the areal radius 
of the two-spheres. 

It is easy to check that the unique null vector N a that satisfies the relations 
N a k a = -1 and N a e a A = is N a = |(1, - sin6 cos <j>, - sin0 sin </>, - cos 6). You 
may also verify that the vectors k a , N a , and e\ combine as in Eq. (3.1.13) to form 
the inverse Minkowski metric. 
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3.2 Integration on hypersurfaces 

3.2.1 Surface element (non-null case) 
If £ is not null, then 

dS = |/i| 1/2 c/ 3 y, (3.2.1) 

where h = det[/i f,], is an invariant three-dimensional volume element on the hyper- 
surface. To avoid confusing this with the four-dimensional volume element \f zr g d 4 x, 
we shall refer to dT, as a surface element. The combination n a dT, is a directed sur- 
face element that points in the direction of increasing $. In the null case these 
quantities are not defined, because h = and n a does not exist. 

To see how Eq. (3.2.1) must be generalized to incorporate also the null case, we 
consider the infinitesimal vector field 

dE„ =e^e?elejd 3 y, (3.2.2) 

where £ Ma /3 7 = ^/—g[^ a f3 7] is the Levi-Civita tensor. We will show below that 

d£ a = sn a dT, (3.2.3) 

when the hypersurface is not null. Thus, apart from a factor e = ±1, dT, a is a 
directed surface element on S. Notice that when £ is spacelike, the factor e = — 1 
makes dT, a a past-directed vector; this is a potential source of confusion. Notice 
also that Eq. (3.2.2) remains meaningful even when the hypersurface is null. By 
continuity, therefore, dT, a is also a directed surface element on a null hypersurface. 

Because dT, a is proportional to the completely antisymmetric Levi-Civita ten- 
sor, its sign depends on the ordering of the coordinates y 1 , y 2 , and y 3 . But this 
ordering is a priori arbitrary, and we need a convention to remove the sign ambi- 
guity. We shall choose an ordering that makes the scalar / = £ lia M n>le i e 2 e J a 
positive quantity. Notice that this convention was already adopted when we went 
from Eq. (3.2.2) to Eq. (3.2.3): n a dT, a = d£ > 0. 

As a first example of how this works, consider a hypersurface of constant t in 
Minkowski spacetime. If $ = t, then n a = —d a t is the future-directed normal 
vector. If we choose the ordering y a = (x,y,z), we find that / = et xyz = 1 has the 
correct sign. Equation (3.2.2) implies d£ M = 6^ dxdydz = — n M dxdydz, which is 
compatible with Eq. (3.2.3). 

As a second example, we take a surface of constant x in Minkowski spacetime. 
We take $ = x, and n a = d a x points in the direction of increasing $. We choose 
the ordering y a = (y,t,z) because / = e xy t z = —£ x tyz = £txyz = 1 has then the 
correct sign. [Notice that the more tempting ordering y a = (t,y,z) would produce 
the wrong sign.] With this choice, Eq. (3.2.2) implies dS M = 5^ dtdydz = n M dtdydz, 
which is compatible with Eq. (3.2.3). 

We now turn to the derivation of Eq. (3.2.3). It is clear that dH^ must be 
proportional to n M , because e^£ Ma/ 3 7 e"ef ej = by virtue of the antisymmetric 
property of the Levi-Civita tensor. So we may write 

Zp.afi'y £\ ^2 ^3 — ^f^ni 

where / = £^ ta/ 3 7 n' i e"ef ej. Because / is a scalar, we may evaluate it in any con- 
venient coordinate system x a . We choose our coordinates so that x° = 3>, and on 
S we identify x a with the intrinsic coordinates y a . Then / = y/—gn^. In these 
coordinates, 5** = 5 a/3 $ ja $ ]( a, and n$ = e |<?**| _1//2 is the only nonvanishing com- 
ponent of the normal. It follows that n* = g® a n a = (?**n$ = |g**| 1//2 , and we 
have that / = | 1//2 - We now use the definition of the matrix inverse to write 
g<s><5> ± co f ac tor(5$$)/(?, where the cofactor of a matrix element is the determinant 
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obtained after eliminating the row and column to which the element belongs. This 
determinant is clearly h, and we conclude that 

/ = N 1/2 - 

While this result was obtained in the special coordinates x a , it is valid in all co- 
ordinate systems because h, like h a b, is a scalar with respect to four-dimensional 
coordinate transformations. This result shows that when £ is not null, Eq. (3.2.3) 
is indeed equivalent to Eq. (3.2.2). 

3.2.2 Surface element (null case) 

As we have seen in Sec. 3.1.2, when £ is null we identify y 1 with A, the parameter 
on the hypersurface's null generators, and the remaining coordinates, denoted 6 A , 
are constant on the generators. Then e" = k a , d 3 y = d\cP6, and we may write the 
directed surface element as 

d£ M = k v dS^d\, (3.2.4) 

where 

dS^ = e^fr e^ej d 2 9 (3.2.5) 

is interpreted as an element of two-dimensional surface area. We will show below 
that this can also be expressed as 

dS aP = 2k [a N /3] ^d 2 8, (3.2.6) 

where N a is the auxiliary null vector field introduced in Eq. (3.1.13), and a = 
det[<7Ae], with gab the two-metric defined by Eq. (3.1.11). Combining Eq. (3.2.6) 
with Eq. (3.2.4) yields 

dS a = -k a Vad 2 6d\. (3.2.7) 

The interpretation of this result is clear: Apart from a minus sign, the surface 
element is directed along k a , the normal to the null hypersurface; the factor dX 
represents an element of parameter-distance along the null generators, and s/acPO 
is an element of cross-sectional area — an element of two-dimensional surface area 
in the directions transverse to the generators. 

There is also an ordering issue with the coordinates 6 A , and our convention shall 
be that the scalar / = e^p^N^k^e^eJ must be a positive quantity. Notice that 
this convention was already adopted when we went from Eq. (3.2.5) to Eq. (3.2.6): 
N a k l3 dS at3 = ^/b r d 2 e>0. 

As an example, consider a surface u = constant in Minkowski spacetime, where 
u = t — x. The normal vector is k a = —d a (t — x), and we may choose the ordering 
6 A = (y, z )- Then N a = —\d a {t + x) satisfies all the requirements for an auxiliary 
null vector field. It is easy to check that with these choices, / = 1 (which is properly 
positive). We obtain dSt x = dydz = —dS x t, and since t can be identified with the 
affine parameter A, Eq. (3.2.4) implies dT, t = dtdydz = —dT, x . These results are 
compatible with Eq. (3.2.7). 

Let us consider a more complicated example: the light cone of Sec. 3.1.4. The 
vectors k a , N a , and e\ are displayed in that section, and the cone's intrinsic coor- 
dinates are y a = (A, 6, </>). We want to compute dH^ for this hypersurface, starting 
with the definition of Eq. (3.2.2). We know that dH^ must point in the direction 
of the normal, so that oE M = —fk^d9d(f>d\, where / = e^p^N^k^e^eJ. If we 
let = and k v = e", we can write this as / = [/i v ^ ^\e^e\e^el = detE, 
where E is the matrix constructed by lining up the four basis vectors. Its deter- 
minant is easy to compute, and we obtain / = A 2 sin# = ~Jo. We therefore have 
dX M = —k^sfa d 2 6d\ which is just the same statement as in Eq. (3.2.7). 
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We must now give a proper derivation of Eq. (3.2.6). The steps are somewhat 
similar to those leading to Eq. (3.2.3). We begin by noting that the tensor e M „/3 7 e^ej 
is orthogonal to e A and antisymmetric in the indices /* and v. It may be expressed 
as 

e 2 e 3 = 2/A;[jj./V„] = fik^Nv — N^ki,), 

where / = e^p^N^k^e^eJ > 0. To evaluate / we choose our coordinates x a such 
that x° = 3>, and x a = y a = (A, 9 A ) on E. In these coordinates, = 1 and k x = 1 
are the only nonvanishing components of the normal vector, TV* = 1 comes as a 
consequence of the normalization condition N a k a = — 1, and = follows from 
the fact that k a is null. Using this information, we deduce that / = y/^g, and we 
must now compute the metric determinant in the specified coordinates. For this 
purpose, we note that the completeness relations of Eq. (3.1.13) imply the following 
structure for the inverse metric: 




this immediately implies detg 1 = — detfu' 4 - 8 ], or y/^g = \fa. We therefore have 

which holds in any coordinate system x a . This shows that Eq. (3.2.6) is indeed 
equivalent to Eq. (3.2.5), and this implies that Eq. (3.2.7) is equivalent to Eq. (3.2.2) 
when E is null and coordinates y a = (X,6 A ) are placed on the hypersurface. 



3. 2. 3 Element of two-surface 

The interpretation of 

as a directed element of two-dimensional surface area is not limited to the con- 
sideration of null hypersurfaces. Here we consider a typical situation, in which 
a two-dimensional surface S is imagined to be embedded in a three-dimensional, 
spacelike hypersurface S. 

The hypersurface S is described by an equation of the form $(x a ) = 0, and 
by parametric relations x a (y a ); n a oc is the future-directed unit normal, and 
the vectors e" = dx a /dy a are tangent to the hypersurface. The metric on E, 
induced from g a p, is /i f, = g a p e"e%, and we have the completeness relations g alB = 
-nV+/i oi, e^. 

The two-surface S is introduced as a submanifold of E. It is described by an 
equation of the form ip(y a ) = 0, and by parametric relations y a (6 A ), in which 
6 A are coordinates intrinsic to S; r a oc d a ip is the outward unit normal, and the 
three- vectors e a A = dy a /d6 A are tangent to the two-surface. The metric on S, 
induced from hat, is gab = h a \ } e A e b B , and we have the completeness relations 
h ab = r a r b + a AB e A e b B . 

The parametric relations y a (6 A ) and x a (y a ) can be combined to give the rela- 
tions x a (6 A ), which describe how S is embedded in the four-dimensional spacetime. 
The vectors 

a _ dx a _ dx a dy a _ a a 

eA ~m A ~ drdd A ~ €a €a 

are tangent to S, and 

r a =r°e£, r a n a = 
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is normal to S. The vector n a is also normal to 5, and we have that the two- 
surface admits two normal vectors: a timelike normal n a and a spacelike normal 
r a . We note that the spacelike normal can be related to a gradient, r a oc d a ^, if 
we introduce, in a neighbourhood of S, a function ty(x a ) such that = tp. In 
this description, the induced metric on S is still 

o~ab = h ab e a A e b B 

= (9 a pe a y b )e a A e b B 

= 9«p{e a a e a A ){ele B ) 

= g a pe A ^ B , 

and the completeness relations 

g aP = _ n a n P + r a r P + ^AB £ a £ 

are easily established from our preceding results. 

We want to show that dS a p can be expressed neatly in terms of the timelike 
normal n a , the spacelike normal r a , and s/ad 2 6, the induced surface element on S. 
The expression is 

dS aP = -2n [a r p] ^/ad 2 e, (3.2.8) 

where a = det[<7Ae]- The derivation of this result involves familiar steps. We first 
note that because e^p^ e^ej is orthogonal to and antisymmetric in ji and v, it 
may be expressed as 

where / = e^p^n^r^e^ej > 0. To evaluate / we adopt coordinates x° = 
x 1 = ^, and on S we identify x A with 6 A . In these coordinates, n$ = — (— g**) -1 / 2 
is the only nonvanishing component of the timelike normal, = (<?' I " I, ) _1/ ' 2 is 
the only nonvanishing component of the spacelike normal, and from the fact that 
these vectors are orthogonal we infer = 0. From all this we find that f 2 = 
gg^^g", which we rewrite as f 2 = cofactor cofactor^ip)/;?. We also have 
cofactor(<7<j^) = 0, and these two equations give us enough information to deduce 

/ = 

This result is true in any coordinate system x a . 

As a final remark, we note that the vectors n a and r a can be combined to form 
null vectors k a and N a . The appropriate relations are 

k a = -i= (n a +r a ), N a = (n a -r a ), 

and these vectors are the null normals of the two-surface S. It is easy to check that 
with these substitutions, Eq. (3.2.8) takes the form of Eq. (3.2.6). 

3.3 Gauss-Stokes theorem 

3.3.1 First version 

We consider a finite region f of the spacetime manifold, bounded by a closed 
hypersurface d"V (Fig. 3.3). The signature of the hypersurface is not restricted; it 
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Figure 3.3: Proof of the Gauss-Stokes theorem. 



may have segments that are timelike, spacelike, or null. We will show that for any 
vector field A" defined within f, 

[ A a y /=gd l x= <[ A a dT, a , (3.3.1) 

where dT, a is the surface element defined by Eq. (3.2.2). 

To prove this result, known as Gauss' theorem, we construct the following co- 
ordinate system in f . We imagine a nest of closed hypersurfaces foliating Y , with 
the boundary d"V forming the outer layer of the nest. (Picture this as the many 
layers of an onion.) We let x° be a constant on each one of these hypersurfaces, with 
x° = 1 designating df, and x° = the zero- volume hypersurface at the "centre" of 
f . While x° grows "radially outward" from this "centre" , we take the remaining 
coordinates x a to be angular coordinates on the closed hypersurfaces x° = constant. 
The coordinates y a on df are then identified with these angular coordinates. 

Using such coordinates, the left-hand side of Eq. (3.3.1) becomes 

/ A a . a ^d 4 x = f (^A a ), a d 4 x 

= J dx° j>{^A°) fi d s x + J dx° fw=gA a ), a £ 

^A° d z x 



x 



j3 x 



= / ^A°d 3 y. 

Jay 

In the first line we have used the divergence formula for the vector field A". The 
second integral of the second line vanishes because x° are angular coordinates and 
the integration is over a closed three-dimensional surface. (Understanding this 
statement requires some thought. Try working through a three-dimensional version 
of the proof, using spherical coordinates in flat space.) In the fourth line, the 
contribution at x° = vanishes because the "hypersurface" x° = has zero volume. 
It is easy to check that in the specified coordinates, dT, a = S° a ^—gd 3 y, giving 



I A a dT, a = I A°^d 3 y 
Jar Jar 



iar Jar 

for the right-hand side of Eq. (3.3.1). The two sides are therefore equal in the 
specified coordinate system; because Eq. (3.3.1) is a tensorial equation, this suffices 
to establish the validity of the theorem. 
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S 2 



Si 



Figure 3.4: Two spacelike surfaces and their normal vectors. 

3.3.2 Conservation 

Gauss' theorem has many useful applications. An example is the following conser- 
vation statement. 

Suppose that a vector field j a has a vanishing divergence, 

j% = o. 

Then j> s j a dT a = for any closed hypersurface S. Supposing now that j a vanishes 
at spatial infinity, we can choose S to be composed of two spacelike hypersurfaces, 
Si and S 2 , extending all the way to infinity (Fig. 3.4), and of a three-cylinder at 
infinity, on which j a = 0. Then 

f j a dT a + f j a dT a =0. 

On each of the spacelike hypersurfaces, dT a = —n a \/hd 3 y, where n a is the outward 
normal to the closed surface S, and h is the determinant of the induced metric on 
the spacelike hypersurfaces. Letting n a = n<i a on £2 and n a = —n\ a on Si, where 
ni a and ni a are both future directed, we have 

j% = => j j a n la Vhd 3 y= f j a n 2a ^/hd 3 y. (3.3.2) 

The interpretation of this result is clear: If j a is a divergence-free vector, then the 
"total charge" J j a n a dT, is independent of the hypersurface on which it is evaluated. 
This is obviously a statement of "charge" conservation. 

3.3.3 Second version 

Another version of Gauss' theorem (usually called Stokes' theorem) involves a three- 
dimensional region S bounded by a closed two-surface dT,. It states that for any 
antisymmetric tensor field B al3 in S, 

f B ap . dZ a = ±<f B a ?dS a p, (3.3.3) 

where dS a p is the two-surface element defined by Eq. (3.2.5). 

The derivation of this identity proceeds along familiar lines. We construct a 
coordinate system such that (i) x° is constant on the hypersurface S, (ii) x 1 = 
constant describes a nest of closed two-surfaces in S (with x 1 = 1 representing dT, 
and x 1 = the zero-area surface at the "centre" of S), and (iii) x A are angular 
coordinates on the closed surfaces (with 9 A = x A on dT) . 
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It is easy to check that with such coordinates, dT, a = 5° a ^/ zr gdx 1 dx 2 dx i . The 
left-hand side of Eq. (3.3.3) becomes 

/ B af> .gdE a = [ -^{yf^B a ^)^d^ a 
± [ {^B^) s dx 1 dx 2 dx 3 

= J dx l j{^B 01 )^dx 2 dx 3 + J dx 1 j>{y/=g~B aA ) tA dx 2 dx 3 



= j> V^gB 01 dx 2 dx 3 



Jos 



p ul d 2 e 



In the first line we have used the divergence formula for an antisymmetric tensor 
field. The explicit expression for dT, a was substituted in the second line. The 
second integral of the third line vanishes because x A are angular coordinates and 
the domain of integration is a closed two-surface. In the fourth line, the lower limit 
of integration does not contribute because the "surface" x 1 = has zero area. 

It is easy to check that in the specified coordinate system, B"^ dS a p = (B 01 — 
B 10 )y/=gd 2 8 = 2B 01 y/=gd 2 6. The right-hand side of Eq. (3.3.3) therefore reads 



., B ap dS a0 = A B y/^gd 6. 

A JdY. J 9E 

Equation (3.3.3) follows from the equality of both sides in the specified coordinate 
system. 



3.4 Differentiation of tangent vector fields 

3.4-1 Tangent tensor fields 

(For the remainder of Chapter 3, except for Sec. 3.11, we shall assume that the 
hypersurface S is either spacelike or timelike. In Sec. 3.11 we shall return to the 
case of null hypersurfaces.) 

Once we are presented with a hypersurface S, it is a common situation to have 
tensor fields A al3 "' that are defined only on S and which are purely tangent to the 
hypersurface. Such tensors admit the following decomposition: 

A aP- = A ab - e * e l3 . . . j ( 3A1 ) 

where e" = dx a /dy a are basis vectors on S. Equation (3.4.1) implies that A a ^"'n a = 
A a/3 '"np = • • • = 0, which confirms that A al3 "' is tangent to the hypersurface. We 
note that an arbitrary tensor T a/3 "' can always be projected down to the hyper- 
surface, so that only its tangential components survive. The quantity that effects 
the projection is h af} = h ab e%e% = g af) - en a n^ and h^h^ ■ ■ ■ T^'" is evidently 
tangent to the hypersurface. 
The projections 

A af} ... e%4 ■■■ = A ab ... ee h am h bn ■ ■ ■ A mn - (3.4.2) 

give the three-tensor A ab "' associated with the tangent tensor field A al3 "'; latin 
indices are lowered and raised with h ab and h ab , respectively. Equations (3.4.1) 
and (3.4.2) show that one can easily go back and forth between a tangent tensor 
field A al3 "' and its equivalent three-tensor A ab "'. We emphasize that while A ab "' 
transforms as a tensor under a transformation y a — > y a of the coordinates intrinsic 
to S, it is a scalar under a transformation x a — > x a of the spacetime coordinates. 
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3.4-2 Intrinsic covariant derivative 

We wish to consider how tangent tensor fields are differentiated. We want to relate 
the covariant derivative of A al3 "' (with respect to a connection that is compatible 
with the spacetime metric g a p) to the covariant derivative of A ab "', defined in terms 
of a connection that is compatible with the induced metric h ab . For simplicity, we 
shall restrict our attention to the case of a tangent vector field A a , such that 

j± — -fi c a , ^i ;i a — u, — e a . 

Generalization to three-tensors of higher ranks will be obvious. 

We define the intrinsic covariant derivative of a three-vector A a to be the pro- 
jection of A a .p onto the hypersurface: 

A a \ b =A a , p e a a el- (3.4.3) 

We will show that A a \ b , as defined here, is nothing but the covariant derivative of 
A a defined in the usual way in terms of a connection r° 6c that is compatible with 

h>ab- 

To get started, let us express the right-hand side of Eq. (3.4.3) as 

A a .ee°4 = {A a e a a ).^ b - A a e a a y 



A a ,ps b — e a ^-pe b A°e1 

dAg dx p 7 p 
dx? dy b e ^ a ^ e b- 

A a ,b ~ ^cabA° , 



where we have defined 



Equation (3.4.3) then reads 



r ca 6 = e1e ai .pel- (3.4.4) 



A a]b = A atb - T c ab A c , (3.4.5) 

which is the familiar expression for the covariant derivative. 

The connection used here is the one defined by Eq. (3.4.4), and we would like 
to show that it is compatible with the induced metric. In other words, we would 
like to prove that r ca j, as defined by Eq. (3.4.4), can also be expressed as 

r co 6 = ^ {hca,b + h cb , a — /l 6,c) • (3.4.6) 

This could be done by directly working out the right-hand side of Eq. (3.4.4). It is 
easier, however, to show that the connection is such that h ab \ c = h a p ;ry e"e b e2 = 0. 
Indeed, 

h <*P;-y e a e b e c = (9a0 - en a np) n e°e^ 

= -£(n a;i np + n a np n ) e"e^ej 
= 0, 



because n a e" = 0. Intrinsic covariant differentiation is therefore the same operation 
as straightforward covariant differentiation of a three-tensor. 



3.4 Differentiation of tangent vector fields 



59 



3.4-3 Extrinsic curvature 

The quantities A a \ b = A a; pe%e b are the tangential components of the vector A a .pe b - 
The question we would like to investigate now is whether this vector possesses also 
a normal component. 

To answer this we re-express A a .pef as g^A^e^ and decompose the metric into 
its normal and tangential parts, as in Eq. (3.1.12). This gives 

A a . 4 = {en a nil + h am e a a e mll )A^el 

= e(n^y b K + h am (A^y b )e:, 

and we see that only the second term is tangent to the hypersurface. We now use 
Eq. (3.4.3) and the fact that A» is orthogonal to n M : 

A" ^1 = -e(n^A^ b )n a +h am A mlb e a a 
= A« lb e:-eA«(n^y b )n a . 

At this point we introduce the three-tensor 

K ab =n a ., p e a a el, (3.4.7) 

called the extrinsic curvature, or second fundamental form, of the hypersurface S. 
In terms of this, we have 

A a . p 4 = A a {b el - eA a K ab n a , (3.4.8) 

and we see that A a ^ b gives the purely tangential part of the vector field, while 
— sA a K ab represents the normal component. This answers our question: the normal 
component vanishes if and only if the extrinsic curvature vanishes. 

We note that if e" is substituted in place of A a , then A° = 6 c a and Eqs. (3.4.5), 
(3.4.8) imply 

C-A= TC a b e a c -eK ab n a . (3.4.9) 

This is known as the Gauss- Weingarten equation. 

The extrinsic curvature is a very important quantity; we will encounter it often 
in the rest of this book. We may prove that it is a symmetric tensor: 

K ba = K ab . (3.4.10) 

The proof is based on the properties that (i) the vectors e" and n a are orthogonal, 
and (ii) the basis vectors are Lie transported along one another, so that e" ;/3 ej = 
e 6;/3 e o- We have 

n a -/3e"e b = — n a e" i/3 ej 
= — n a ej.^e„ 
= n a;/3 e£e^, 

and Eq. (3.4.10) follows. The symmetry of the extrinsic curvature implies the 
relations 

K ab = n {a . lP) e%4 = \ {£ n g a p)e a a el (3.4.11) 

and the extrinsic curvature is therefore intimately related to the normal derivative 
of the metric tensor. 

We also note the relation 

K ee h ab K ab = n a . a , (3.4.12) 
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which shows that K is equal to the expansion of a congruence of geodesies that 
intersect the hypersurface orthogonally (so that their tangent vector is equal to n a 
on the hypersurface). From this result we conclude that the hypersurface is convex 
if K > (the congruence is diverging), or concave if K < (the congruence is 
converging) . 

We see that while h ab is concerned with the purely intrinsic aspects of a hyper- 
surface's geometry, K ab is concerned with the extrinsic aspects — the embedding 
of the hypersurface in the enveloping spacetime manifold. Taken together, these 
tensors provide a virtually complete characterization of the hypersurface. 



3.5 Gauss-Codazzi equations 

3. 5. 1 General form 

We have introduced the induced metric h ab and its associated intrinsic covariant 
derivative. A purely intrinsic curvature tensor can now be defined by the relation 

A c \ a b - A c \ ba = -R c dab A d , (3.5.1) 

which of course implies 

n dab — 1 db,a ~ 1 da,b "+" 1 ma L db ~ 1 mb L da- (O.O. ZJ 

The question we now examine is whether this three-dimensional Riemann tensor 
can be expressed in terms of -R 7 5a/3 — the four-dimensional version — evaluated on 
S. 

To answer this we start with the identity 

KA)-^c = (r d ab e a d -eK ab n a ) n e2 
which follows immediately from Eq. (3.4.9). We first develop the left-hand side: 

lhs = {e° P 4) n e l 

— P a pPp'y _i_ P a P P pi 

— e a;P-y e b e c + e o;/3 e 6; 7 e c 

= C^^c+CA^A-eKb^) 

= eleAel + T\ c {T\ d e a e - sK ad n a ) - eK bc e^. 

Next we turn to the right-hand side: 

rhs = {T d ab e a d -eK ab n a ). n eZ 

= ^ d ab,c e d + F d ab e d;-y e c ~ £ K a b^ c n a — eK a b'n a . ry e2 

= r d a6jC e2 + T d ab (r e dc e« - eK dc n a ) - eK ab , c n a - eK^eJ. 

We now equate the two sides and solve for e^.^e^ej. Subtracting a similar ex- 
pression for e a , . ryj3 e'Je b gives — i? a ^ e^e^e^, the quantity we are interested in. After 
some algebra, we find 

Watfa&l = R m a b A+<K ab \ c - K ac{b )n» + eK ab n^eZ - sK ac n»y b . 
Projecting along e dfi gives 

Rap-yS eZe b e1e 5 d = R abcd + e(K ad K bc - K ac K bd ), (3.5.3) 
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and this is the desired relation between R a bcd and the full Riemann tensor. Pro- 
jecting instead along n M gives 

R^n'eZeZel = K ab]c - K ac]b . (3.5.4) 

Equations (3.5.3) and (3.5.4) are known as the Gauss- Codazzi equations. They 
reveal that the spacetime curvature can be expressed in terms of the intrinsic and 
extrinsic curvatures of a hypersurface. 

3. 5. 2 Contracted form 

The Gauss-Codazzi equations can also be written in contracted form, in terms of 
the Einstein tensor G a p = R a p — \Rg a p. The spacetime Ricci tensor is given by 

Ra/3 = Q^Rpavfi 

= {en»n" +h mn e% l e" n )R liav p 

= eR^pnV + h mn R^ p e^ 

and the Ricci scalar is 

R = g aP R af } 

= (en a nP + h ab e^ b ) {eR mvf} n»n v + h mn R m ^e^ n ) 
= 2eh ab R liav ^e a a n v el + h ab h mn R mv ^ m e a a e v n el 

A little algebra then reveals the relations 

-2£G aP n a n p = 3 R + e (K ab K ab - K 2 ) (3.5.5) 

and 

G aP e a y = K b a{b -K ta . (3.5.6) 

Here, 3 R = h ab R m amb is the three-dimensional Ricci scalar. The importance of 
Eqs. (3.5.5) and (3.5.6) lies with the fact that they form part of the Einstein field 
equations on a hypersurface S; this observation will be elaborated in the next 
section. We note that G a p e"e b , the remaining components of the Einstein tensor, 
cannot be expressed solely in terms of h ab , K ab , and related quantities. 

3.5.3 Ricci scalar 

We now complete the computation of the four-dimensional Ricci scalar. Our starting 
point is the relation 

R = 2eh ab R liav ^e a a n v e p b + h ab h mn R^ av ^ m e^ n e^ 

which was derived previously. The first term is simplified by using the completeness 
relation (3.1.12) and the fact that R liav ^n il n a n v vP = 0; it becomes 2eR a ^n a vP . 
Using the definition of the Riemann tensor, we rewrite this as 

= -in a ,y), + n« a n% + (n a ,^), a - n W 

In the second term of this last expression we recognize K 2 , where K = n a . a is 
the trace of the extrinsic curvature. The fourth term, on the other hand, can be 
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expressed as 

= (en^n" + h?»)(En a n v + h av )n a .pn KV 

= {en p n» + h^)h a "n a .pn^ 

= h^h av n a -,pn^ v 

= h^h^n^e^n^e^ 

= h bm h an K ab K mn 

= K ab K ba 

= K ab K ab . 

In the second line we have inserted the completeness relation (3.1.12) and used the 
notation h a/3 = h ab e"e^. In the third and fourth lines we have used the fact that 
n a n a -jj = \{n a n a )-p = 0. In the sixth line we have substituted the definition (3.4.7) 
for the extrinsic curvature. Finally, in the last line we have used the fact that K ab 
is a symmetric three-tensor. 

The previous manipulations take care of the first term in our starting expression 
for the Ricci scalar. The second term is simplified by substituting the Gauss-Codazzi 
equation (3.5.3), 

h ab h mn R^ p e^ m e^y b = h ab h mn [R manb + e(K mb K an - K mn K ab )] 

= 3 R + e(K ab K ab -K 2 ). 

Putting all this together, we arrive at 

R = 3 R + e(K 2 - K ab K ab ) + 2e{n a . p n p - n a n p . ) . (3.5.7) 

This is the four-dimensional Ricci scalar evaluated on the hypersurface X. This 
result will be put to good use in Chapter 4. 

3.6 Initial-value problem 

3. 6. 1 Constraints 

In Newtonian mechanics, a complete solution to the equations of motion requires 
the specification of initial values for the position and velocity of each moving body. 
In field theories, a complete solution to the field equations requires the specification 
of the field and its time derivative at one instant of time. 

A similar statement can be made for general relativity. Because the Einstein 
field equations are second-order partial differential equations, we would expect that 
a complete solution should require the specification of g a p and g a p,t at one instant 
of time. While this is essentially correct, it is desirable to convert this decidedly 
noncovariant statement into something more geometrical. 

The initial-value problem of general relativity starts with the selection of a 
spacelike hypersurface S which represents an "instant of time" . This hypersurface 
can be chosen freely. On this hypersurface we put arbitrary coordinates y a . 

The spacetime metric g a p, when evaluated on S, has components that charac- 
terize displacements away from the hypersurface. (For example, gu is such a com- 
ponent if S is a surface of constant t.) These components cannot be given meaning 
in terms of the geometric properties of S alone. To provide meaningful initial values 
for the spacetime metric, we must consider displacements within the hypersurface 
only. In other words, the initial values for g a p can only be the six components of 
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the induced metric h a f, = g a /3e"e^; the remaining four components are arbitrary, 
and this reflects the complete freedom in choosing the spacetime coordinates x a . 

Similarly, the initial values for the "time derivative" of the metric must be 
described by a three-tensor that carries information about the derivative of the 
metric in the direction normal to the hypersurface. Because = ^(£ n g a p) e " e b' 
the extrinsic curvature is clearly an appropriate choice. 

The initial-value problem of general relativity therefore consists in specifying two 
symmetric tensor fields, /i f, and K^, on a spacelike hypersurface X. In the complete 
spacetime, /i f, is interpreted as the induced metric on the hypersurface, while 
is the extrinsic curvature. These tensors cannot be chosen freely: they must satisfy 
the constraint equations of general relativity. These are given by Eqs. (3.5.5) and 
(3.5.6), together with the Einstein field equations G a p = 87rT a/ 3: 

3 R + K 2 - K ab K ab = 16TrT aP n a n p = 16np (3.6.1) 

and 

K b a]b - K, a = SnTapey 13 = 8nj a . (3.6.2) 

The remaining components of the Einstein field equations provide evolution equa- 
tions for hat, and K^; these will be considered in Chapter 4. 

3.6.2 Cosmological initial values 

As an example, let us solve the constraint equations for a spatially flat, isotropic, 
and homogeneous cosmology. To satisfy these requirements, the three-metric must 
take the form 

ds 2 = a 2 {dx 2 + dy 2 + dz 2 ), 

where a is the scale factor, which is a constant on the hypersurface. Isotropy and 
homogeneity also imply p = constant, j a = 0, and 

K a b = - Kh a b, 

where K is a constant. The second constraint equation is therefore trivially satisfied. 
The first one implies 

167T/9 = K 2 - K ab K ab = ^K 2 , 

and this provides the complete solution to the initial-value problem. 

To recognize the physical meaning of this last equation, we use the fact that 
in the complete spacetime, K = n a . a , where n a is the unit normal to surfaces of 
constant t. The full metric is given by the Friedmann- Robertson- Walker form 

ds 2 = -dt 2 + a 2 (t)(dx 2 + dy 2 + dz 2 ), 

so that n a = —d a t and K = 3a/a, where an overdot indicates differentiation with 
respect to t. The first constraint equation is therefore equivalent to 

2>{a/a) 2 = 8irp, 

which is one of the Friedmann equations governing the evolution of the scale factor. 

3.6.3 Moment of time symmetry 

We notice from the previous example that = when a = 0, that is, the 
extrinsic curvature vanishes when the scale factor passes through a turning point of 
its evolution. Because the dynamical history of the scale factor is time-symmetric 
about the time t = to at which the turning point occurs, we may call this time a 
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moment of time symmetry in the evolution of the spacetime. Thus, K a f, = at this 
moment of time symmetry. 

Generalizing, we shall call any hypersurface £ on which K a i = a moment of 
time symmetry in spacetime. Because K a i is essentially the "time derivative" of the 
metric, a moment of time symmetry corresponds to a turning point of the metric's 
evolution, at which its "time derivative" vanishes. The dynamical history of the 
metric is then "time-symmetric" about S. From Eq. (3.6.2) we see that a moment 
of time symmetry can occur only if j a = on that hypersurface. 

3.6.4 Stationary and static spacetimes 

A spacetime is stationary if it admits a timelike Killing vector This means that 
in a coordinate system (t,x a ) in which £ a = 6 a t , the metric does not depend on the 
time coordinate t: g a p,t = (see Sec. 1.5). For example, a rotating star gives rise 
to a stationary spacetime if its mass and angular velocity do not change with time. 

A stationary spacetime is also static if the metric does not change under a time 
reversal, t — > —t. For example, the spacetime of a rotating star is not static because 
a time reversal changes the direction of rotation. In the specified coordinate system, 
invariance of the metric under a time reversal implies gt a =0. This, in turn, implies 
that the Killing vector is proportional to a gradient: £ a = gttd a t. Thus, a spacetime 
is static if the timelike Killing vector field is hypersurface orthogonal. 

We may show that if a spacetime is static, then = on those hypersurfaces 
£ that are orthogonal to the Killing vector; these hypersurfaces therefore represent 
moments of time symmetry. If £ is orthogonal to £ a , then its unit normal must given 
by n a = fi £ a , where 1/p 2 = This implies that n a;/3 = (i£ a; p +£ a /*„a, and 

n (a;/3) = ^(aA*,/3) because £ a is a Killing vector. That = follows immediately 
from Eq. (3.4.11) and the fact that £ a is orthogonal to e". 

3. 6. 5 Spherical space, moment of time symmetry 

As a second example, we solve the constraint equations for a spherically symmetric 
spacetime at a moment of time symmetry. The three-metric can be expressed as 

ds 2 = [l-2m(r)/r]~ 1 dr 2 +r 2 dtf, 

for some function m(r); to enforce regularity of the metric at r = we must impose 
m(0) = 0. The Ricci scalar is given by 3 R = 4m'/r 2 , with a prime denoting 
differentiation with respect to r. Because K a i = at a moment of time symmetry, 
Eq. (3.6.1) implies 16np = 3 R. Solving for m(r) gives 

m(r) = / 4nr' 2 p(r') dr' . 
Jo 

This states, loosely speaking, that m(r) is the mass-energy contained inside a sphere 
of radius r, at the selected moment of time symmetry. 

3. 6. 6 Spherical space, empty and flat 

We now solve the constraint equations for a spherically symmetric space empty of 
matter (so that p = = j a ). We assume that we can endow this space with a flat 
metric, so that 

h ab dy a dy b = dr 2 + r 2 dtl 2 . 

We also assume that the hypersurface does not represent a moment of time sym- 
metry. While the flat metric and = make a valid solution to the constraints, 
this is a trivial configuration — a flat hypersurface in a flat spacetime. 
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Let n a = d a r be a unit vector that points radially outward on the hypersurface. 
The fact that K a b is a spherically symmetric tensor means that it can be expressed 
as 

Kab = Ki(r)n a n b + K 2 (r)(h ab - n a n b ), 

with K\ representing the radial component of the extrinsic curvature, and K 2 the 
angular components. In the usual spherical coordinates (r,6,(f)), we have K a b = 
diag(JCi, K 2 , K 2 ), which is the most general expression admissible under the as- 
sumption of spherical symmetry. 

Because the space is empty and flat, the first constraint equation reduces to 
K 2 — K ab K ab = 0, an algebraic equation for Ki and K 2 . This gives us the condition 
(2Ki + K 2 )K 2 =0. Choosing K 2 = would eventually return the trivial solution 
K ab = 0. We choose instead K 2 = —2Ki and re-express the extrinsic curvature as 

Kab = K{r) Q h ab - n a n^j , 

where K = —3Ki is the sole remaining function to be determined. 

To find K (r), we turn to the second constraint equation, K b a ^ b — K, a = 0, which 
becomes 

i K, a + (Kn b lb + K, b n b )n a + Kn a]b n b = 0. 

With K^ a = K'n a (with a prime denoting differentiation with respect to r), n 6 | 6 = 
2/r, and n a \bn b = (because the radial curves are geodesies of the hypersurface), 
we arrive at 2rK' + 3K = 0. Integration yields 

K(r) = K (r/r f\ 

with K denoting the value of K at the arbitrary radius Tq. 

We have found a nontrivial solution to the constraint equations for a spherical 
space that is both empty and flat. The physical meaning of this configuration will 
be revealed in Sec. 3.13, Problem 1. 



3.6.7 Conformally-flat space 

A powerful technique for generating solutions to the constraint equations consists 
of writing the three-metric as 

Kb = tp 4 8 a b, 

where tp(y a ) is a scalar field on the hypersurface. Such a metric is said to be 
conformally related to the flat metric, and the space is said to be conformally flat. 
For this metric the Ricci scalar is 3 R = —8tp~ 5 S/ 2 tp, and Eq. (3.6.1) takes the form 
of Poisson's equation, 

V 2 V = -27r/9 eff , 

where 

P^=A P+ ^{ KabKab - K2 )} 

is an effective mass density on the hypersurface. At a moment of time symmetry, 
this simplifies to /9 e fr = ip 5 p, and one possible strategy for solving the constraint is 
to specify p e s, solve for tp, and then see what this produces for the actually mass 
density p. If p = at the moment of time symmetry, then the constraint becomes 
Laplace's equation V 2 V> = 0, and this admits many interesting solutions. A well- 
known example is Misner's (1960) solution, which describes two black holes about 
to undergo a head-on collision. This initial data set has been vigourously studied 
by numerical relativists. 
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3.7 Junction conditions and thin shells 

The following situation sometimes presents itself: A hypersurface £ partitions 
spacetime into two regions f + and V~ (Fig. 3.5). In 'f + the metric is g^, and 
it is expressed in a system of coordinates x". In f~ the metric is g~p, and it is 
expressed in coordinates x". We ask: what conditions must be put on the metrics 
to ensure that f + and ~f~ are joined smoothly at £, so that the union of and 
g~p forms a valid solution to the Einstein field equations? To answer this question 
is not entirely straightforward because in practical situations, the coordinate sys- 
tems x± will often be different, and it may not be possible to compare the metrics 
directly. To circumvent this difficulty we will endeavour to formulate junction con- 
ditions that involve only three-tensors on S. In this section we will assume that £ 
is either timelike or spacelike; we will return to the case of a null hypersurface in 
Sec. 3.11. 

3. 7. 1 Notation and assumptions 

We assume that the same coordinates y a can be installed on both sides of the 
hypersurface, and we choose n a , the unit normal to S, to point from f~ to f + . 
We suppose that an overlapping coordinate system x a , distinct from x±, can be 
introduced in a neighbourhood of the hypersurface. (This is for our short-term 
convenience; the final formulation of the junction conditions will not involve this 
coordinate system.) We imagine £ to be pierced by a congruence of geodesies that 
intersect it orthogonally. We take £ to denote proper distance (or proper time) 
along the geodesies, and we adjust the parameterization so that £ = when the 
geodesies cross the hypersurface; our convention is that £ is negative in f~ and 
positive in f + . We can think of £ as a scalar field: The point P characterized by 
the coordinates x a is linked to £ by a member of the congruence, and £(x a ) is the 
proper distance (or proper time) from £ to P along this geodesic. Our construction 
implies that n a is equal to dx a /d£ at the hypersurface, and that 

n a = ed a £; (3.7.1) 

we also have n a n a = e. 

We will use the language of distributions. We introduce the Heaviside distribu- 
tion Q(£), equal to +1 if £ > 0, if £ < 0, and indeterminate if £ = 0. We note the 
following properties: 

6 2 (f) = e(£), 0(£)0(-£) = O, ±Q(£) = 6(£), 

where 5{£) is the Dirac distribution. We also note that the product Q(£)6(£) is not 
defined as a distribution. 




Figure 3.5: Two regions of spacetime joined at a common boundary. 
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The following notation will be useful: 

[A]=A(y+)\ s -A(y-)\ s , 

where A is any tensorial quantity defined on both sides of the hypersurface; [A] is 
therefore the jump of A across S. We note the relations 

[n a ] = [e:]=0, (3.7.2) 

where e" = dx a /dy a . The first follows from the relation n a = dx a /d£ and the 
continuity of both I and x a across S; the second follows from the fact that the 
coordinates y a are the same on both sides of the hypersurface. 

3. 7. 2 First junction condition 

We begin by expressing the metric g a p, in the coordinates distribution- 
valued tensor: 

9 a p = Q{t)9ie + Q{-t)9-p, (3.7.3) 

where g^p is the metric in Y ± expressed in the coordinates x a . We want to know if 
the metric of Eq. (3.7.3) makes a valid distributional solution to the Einstein field 
equations. To decide, we must verify that geometrical quantities constructed from 
g a p, such as the Riemann tensor, are properly defined as distributions. We must 
then try to eliminate, or at least give an interpretation to, singular terms that might 
arise in these geometric quantities. 
Differentiating Eq. (3.7.3) yields 

<?a/3, 7 = &(£) gtp n + &(-£) g~ Pn + eS(l) [<?«/?]%, 

where Eq. (3.7.1) was used. The last term is singular, and it causes problems when 
we compute the Christoffel symbols, because it generates terms proportional to 
Q(£)5(£). If the last term were allowed to survive, therefore, the connection would 
not be defined as a distribution. To eliminate this term, we impose continuity 
of the metric across the hypersurface: [g a p] = 0. This statement holds in the 
coordinate system x a only. However, we can easily turn this into a coordinate- 
invariant statement: = [g a p]e"e^ = [g a pe"e^]; this last step follows by virtue of 
Eq. (3.7.2). We have obtained 

[hob] = 0, (3.7.4) 

the statement that the induced metric must be the same on both sides of S. This 
is clearly required if the hypersurface is to have a well-defined geometry. Equation 
(3.7.4) will be our first junction condition, and it is expressed independently of the 
coordinates x a or x±. 

3. 7. 3 Riemann tensor 

To find the second junction condition requires more work: we must calculate the 
distribution-valued Riemann tensor. Using the results obtained thus far, we have 
that the Christoffel symbols are 

r^ = 0(i)r + ; 7 + 0(-i)r-; 7 , 

where T^" are the Christoffel symbols constructed from g^. A straightforward 
calculation then reveals 

rv 4 = em r+° 7i4 + e(-f) r-£ i4 + ed{i) [r^ 7 K 
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and from this follows the Riemann tensor: 

R a Pl5 = 0(f) R+ a lS + 0(-f) RJ lS + 5{i)A^ lS , (3.7.5) 

where 

A a w = e([r%]n 7 - [T%]n e ). (3.7.6) 

We see that the Riemann tensor is properly defined as a distribution, but the 
(^-function term represents a curvature singularity at S. Our second junction con- 
dition will seek to eliminate this term. Failing this, we will see that a physical 
interpretation can nevertheless be given to the singularity. This is our next topic. 

3. 7.4 Surface stress- energy tensor 

Although they are constructed from Christoffel symbols, the quantities A a p s form 
a tensor, because the difference between two sets of Christoffel symbols is a tensorial 
quantity (see Sec. 1.2). We must now find an explicit expression for this tensor. 

The fact that the metric is continuous across £ in the coordinates x a implies 
that its tangential derivatives must also be continuous. This means that if g a p,-y is 
to be discontinuous, the discontinuity must be directed along the normal vector n a . 
There must therefore exist a tensor field K a p such that 

[Sa0, 7 ] = K a/3 n 7 ; (3.7.7) 

this tensor is given explicitly by 

n a p = ^[gap^n 1 . (3.7.8) 

Equation (3.7.7) implies 

[ T %] = \ ( K "/3% + " a 7 «i3 - KpryTl"), 

and we obtain 

A a p jS = |(K°5ngn 7 - K a ^npns - Kpsn a rij + Kpjn a ns). 

This is the 6- function part of the Riemann tensor. 

Contracting over the first and third indices gives the 5-function part of the Ricci 
tensor: 

A a p = A» a ^ = - {K^n^np + K^,pn^n a - nn a np - en a p), 

where k = K a a . After an additional contraction we obtain the 5-function part of 
the Ricci scalar, 

A = A a a = e{K liV n ii n v - en). 

With this we form the 5-function part of the Einstein tensor, and after using the 
Einstein field equations, we obtain an expression for the stress-energy tensor: 

T aP = 0(£) T+ p + 0(-£) T~p + 5(£)S a p, (3.7.9) 

where 87r5 a/ 3 = A a p — \Ag a p. In Eq. (3.7.9), the first and second terms represent 
the stress-energy tensors of regions Y + and Y~ , respectively. The 5-function term, 
on the other hand, comes with a clear interpretation: it is associated with the 
presence of a thin distribution of matter — a surface layer, or a thin shell — at S; 
this thin shell has a surface stress-energy tensor given by S a p- 
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3.7.5 Second junction condition 

Explicitly, the surface stress-energy tensor is given by 

16neS a/ 3 = + K^n^ria - nn a n^ - £K a $ - (n^n^n" - £n)g a/ 3. 

From this we notice that S a p is tangent to the hypersurface: Sapn 13 = 0. It therefore 
admits the decomposition 

S"? = S ab e«e p b , (3.7.10) 
where S ab = S aj 3e"ef is a symmetric three-tensor. This is evaluated as follows: 

167r5 a6 = -Kape^el - e^^n^n" - £K)h ab 

= -n aP e a y b - Kliv {gv - h mn e^ n )h ab + nh ab 

— —k aP a pP -\-h mn K P V h i 

On the other hand, we have 

= \ ( £K «/3 - K ia n f} n' y - K 1 pn a n' y ) , 

which allows us to write 

[K ab ] = [n a;/3 ]e£ef = £ -n al3 e«el. 

Combining these results, we obtain 

Sab = ([K ab ] ~ [K]h ab ), (3.7.11) 

which relates the surface stress-energy tensor to the jump in extrinsic curvature 
from one side of £ to the other. The complete stress-energy tensor of the surface 
layer is 

T^f = S(£) S ab e"ef. (3.7.12) 

We conclude that a smooth transition across £ requires [K ab ] = — the extrin- 
sic curvature must be the same on both sides of the hypersurface. This require- 
ment does more than just remove the 5-function term from the Einstein tensor: In 
Sec. 3.13, Problem 4 you will be asked to prove that [K ab ] = implies A a p s = 0, 
which means that the full Riemann tensor is then nonsingular at S. 

The condition [K ab ] = is our second junction condition, and it is expressed 
independently of the coordinates x a and x±. If this condition is violated, then 
the spacetime is singular at S, but the singularity comes with a straightforward 
interpretation: a surface layer with stress-energy tensor T^P is present at the hy- 
persurface. 

Notice that a minor miracle is at work here: When [K ab ] ^ 0, only the Ricci 
part of the Riemann tensor acquires a singularity, and it is this part that can readily 
be associated with matter. The remaining part of the Riemann tensor — the Weyl 
part — is smooth even when the extrinsic curvature is discontinuous. 

3. 7. 6 Summary 

The junction conditions for a smooth joining of two metrics at a hypersurface £ 
(assumed not to be null) are 

[h ab ] = [K ab ] = 0. 
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If the extrinsic curvature is not the same on both sides of £, then a thin shell with 
surface stress-energy tensor 



Sab = -^([K ab ] ~ [K]h ab ) 



is present at S. The complete stress-energy tensor of the surface layer is given by 
Eq. (3.7.12) in the overlapping coordinates x a . In the coordinate system x± used 
originally in y ± , it is 

This follows from Eq. (3.7.12) by a simple coordinate transformation from x a to 
x±; such a transformation leaves both I and S ab invariant. 

This formulation of the junction conditions is due to Darmois (1927) and Israel 
(1966). The thin-shell formalism is due to Lanczos (1922 and 1924) and Israel 
(1966); an extension to null hypersurfaces will be considered in Sec. 3.11. 



3.8 Oppenheimer-Snyder collapse 

In 1939, J. Robert Oppenheimer and his student Hartland Snyder published the first 
solution to the Einstein field equations that describes the process of gravitational 
collapse to a black hole. For simplicity, they modeled the collapsing star as a 
spherical ball of pressureless matter with a uniform density. (A perfect fluid with 
negligible pressure is usually called dust.) The metric inside the dust is a Friedmann- 
Robertson- Walker (FRW) solution, while the metric outside is the Schwarzschild 
solution (Fig. 3.6). The question considered here is whether these metrics can be 
joined smoothly at their common boundary, the surface of the collapsing star. 
The metric inside the collapsing dust (which occupies the region Y~) is given 

by 

ds 2 _ = -dr 2 + a 2 (r)(dx 2 + sin 2 X d£l 2 ), (3.8.1) 

where r is proper time on comoving world lines (along which x> 8, and (f> are all 
constant), and a(r) is the scale factor. By virtue of the Einstein field equations, 
this satisfies 




(3.8.2) 



Figure 3.6: The Oppenheimer-Snyder spacetime. 
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where an overdot denotes differentiation with respect to r. By virtue of energy- 
momentum conservation in the absence of pressure, the dust's mass density p sat- 
isfies 

3 

pa 3 = constant = — a max , (3.8.3) 

where a max is the maximum value of the scale factor. The solution to Eqs. (3.8.2) 
and (3.8.3) has the parametric form 

a(r)) = |a max (l + cosf?), t(t)) = |o max (?? + sin 77); 

the collapse begins at r} = when a = a max , and it ends at r} = n when a = 0. The 
hypersurface £ coincides with the surface of the collapsing star, which is located at 
X = Xo in our comoving coordinates. 

The metric outside the dust (in the region f + ) is given by 

ds 2 + = -fdt 2 + f~ 1 dr 2 +r 2 dn 2 , f = 1 - 2M/r, (3.8.4) 

where M is the gravitational mass of the collapsing star. As seen from the outside, £ 
is described by the parametric equations r = R(t), t = T(r), where r is proper time 
for observers comoving with the surface. Clearly, this is the same r that appears in 
the metric of Eq. (3.8.1). 

It is convenient to choose y a = (r, #,</>) as coordinates on S. This implies that 
e" = u a , where u a is the four-velocity of an observer comoving with the surface of 
the collapsing star. 

We now calculate the induced metric. As seen from f~, the metric on £ is 

ote| = — dr 2 + a 2 (r) sin 2 xo d£l 2 . 

As seen from on the other hand, 

ds% = -(FT 2 - F^R 2 ) dr 2 + R 2 (t) dti 2 , 

where F = 1 — 2M/R. Because the induced metric must be the same on both sides 
of the hypersurface, we have 

R( T ) = (r) sin xo, FT 2 - F^R 2 = 1. (3.8.5) 

The first equation determines R(t), and the second equation can be solved for T: 

FT =VR 2 +F ee P(R,R). (3.8.6) 

This equation can be integrated for T(r), and the motion of the boundary in f + 
is completely determined. 

The unit normal to S can be obtained from the relations n a u a = 0, n a n a = 1. 
As seen from f~, u°L d a = d T and n~ dx a = adx; we have chosen n x > so that 
n a is directed toward ~f + . As seen from f + , u"d a =Td t +Rd r and n+dx a = 
—Rdt + Tdr, with a consistent choice for the sign. 

The extrinsic curvature is defined on either side of S by = n a; pe"e^. The 
nonvanishing components are K TT = n a -pu a u^ = —n a u a . j3 u 13 = —a a n a (where a a 
is the acceleration of an observer comoving with the surface), Kqq = ng-g, and 
K <)><)> = Ti0;0- A straightforward calculation reveals that as seen from Y~ , 

KL T = 0, K e _ g = = a- 1 cot X o- (3.8.7) 

The first relation follows immediately from the fact that the comoving world lines 
of a FRW spacetime are geodesies. As seen from f + , 

K\ T = $/R, K% = K* = P/R, (3.8.8) 
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where P(R,R) is defined by Eq. (3.8.6). 

To have a smooth transition at the surface of the collapsing star, we demand 
that K a i, be the same on both sides of the hypersurface. It is therefore necessary 
for u" to satisfy the geodesic equation (a" = 0) in f + . It is easy to check that the 
geodesic equation implies R 2 + F = E 2 , where E = — u t is the (conserved) energy 
parameter of the comoving observer. This relation implies P = E, and the fact that 
/3 is a constant enforces K+ T = 0, as required. On the other hand, [K e e ] = implies 

cotxo/a = 0/R = E/(asmxo), or 

P = E = cosxo- (3.8.9) 



We have found that the requirement for a smooth transition at £ is that the hyper- 
surface be generated by geodesies of both ~f~ and and that the parameters E 
and xo be related by Eq. (3.8.9). With the help of Eqs. (3.8.2), (3.8.5), and (3.8.6), 
we may turn Eq. (3.8.9) into 

47T 

M = —pR 3 , (3.8.10) 
o 

which equates the gravitational mass of the collapsing star to the product of its 
density and volume. This relation has an immediate intuitive meaning, and it 
neatly summarizes the complete solution to the Oppenheimer-Snyder problem. 



3.9 Thin-shell collapse 

As an application of the thin-shell formalism, we consider the gravitational collapse 
of a thin spherical shell. We assume that spacetime is flat inside the shell (in 
Outside (in the metric is necessarily a Schwarzschild solution (by virtue of the 
assumed spherical symmetry) . We assume also that the shell is made of pressureless 
matter, in the sense that its surface stress-energy tensor is constrained to have the 
form 

S ab = au a u b , (3.9.1) 

in which a is the surface density and u a = dy a /dr the shell's velocity field. Our 
goal is to derive the shell's equations of motion under the stated conditions. 
Using the results derived in the preceding section, we have 

K T ±T = P ± /R, 
K±e = K^ = P ± /R, 
P+ = \Jh 2 + l-2M/R, 

P- = Vr 2 + 1, 



where R(t) is the shell's radius, and M its gravitational mass. As we did before, 
we use {t,6,4>) as coordinates on S. Equation (3.7.11) allows us to calculate the 
components of the surface stress-energy tensor, and we find 



4ttR ' " 8ttR 8ttR 

The second equation can be integrated immediately, giving (/3 + — P-)R = constant. 
Substituting this into the first equation yields 4nR 2 a = —constant. 
We have obtained 

A.-kR 2 o = m = constant (3.9.2) 
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and /3_ — /?+ = m/R. The first equation states that ra, the shell's rest mass, stays 
constant during the evolution. Squaring the second equation converts it to 

i : — ff) 2 

M = mVl+R 2 - — , (3.9.3) 
2R 

which comes with a nice physical interpretation. The first term on the right-hand 
side is the shell's relativistic kinetic energy, including rest mass. The second term is 
the shell's binding energy, the work required to assemble the shell from its dispersed 
constituents. The sum of these is the total (conserved) energy, and this is equal to 
the shell's gravitational mass M. Equation (3.9.3) provides a vivid illustration of 
the general statement that all forms of energy contribute to the total gravitational 
mass of an isolated body. 

Equations (3.9.2) and (3.9.3) are the shell's equations of motion. It is interesting 
to note that if M < m, then the motion exhibits a turning point at R = i? ma x = 
m 2 /[2(m— M)]: an expanding shell with M < m cannot escape its own gravitational 
pull. 



3.10 Slowly rotating shell 

Our next application of the thin-shell formalism is concerned with the spacetime of a 
slowly rotating, spherical shell. We take the exterior metric to be the slow-rotation 
limit of the Kerr solution, 

ds 2 , = -/ dt 2 + J' 1 dr 2 + r 2 dti 2 - — sin 2 6 dtdcf,. (3.10.1) 

r 

Here, / = 1 — 2M/r, with M denoting the shell's gravitational mass, and a = 
J/M <C M, where J is the shell's angular momentum. Throughout this section we 
will work consistently to first order in a. 

The metric of Eq. (3.10.1) is cut off at a radius r = R at which the shell is 
located. As viewed from the exterior, the shell's induced metric is 

ds% = -(1 - 2M/R) dt 2 + R 2 dn 2 - sin 2 6 dtdcp. 

R 

It is possible to remove the off-diagonal term by going to a rotating frame of refer- 
ence. We therefore introduce a new angular coordinate tp related to <p by 

ip = 4>-nt, (3.10.2) 

where fi is the angular velocity of the new frame with respect to the inertial frame 
of Eq. (3.10.1). We anticipate that fi will be proportional to a, and this allows us 
to approximate d(f> 2 by dtp 2 +2Qdtdtp. Substituting this into ds% returns a diagonal 
metric if fi is chosen to be 

(3-10-3) 

With this, the induced metric becomes 

h ab dy a dy b = -(1 - 2M/R) dt 2 + R 2 (d6 2 + sin 2 6 dtp 2 ). (3.10.4) 

It is now clear that the shell has a spherical geometry. As Eq. (3.10.4) indicates, 
we will use the coordinates y a = (t, 6, tp) on the shell. 

We take spacetime to be flat inside the shell, and we write the Minkowski metric 
in the form 



ds 2 _ = -(1 - 2M/R) dt 2 + dp 2 + p 2 {d6 2 + sin 2 6 dip 2 ), (3.10.5) 
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where p is a radial coordinate. This metric must be cut off at p = R and matched 
to the exterior metric of Eq. (3.10.1). The shell's intrinsic metric, as computed from 
the interior, agrees with Eq. (3.10.4). Continuity of the induced metric is therefore 
established, and we must now turn to the extrinsic curvature. 

We first compute the extrinsic curvature as seen from the shell's exterior. In the 
metric of Eq. (3.10.1), the shell's unit normal is n a = f~^^ 2 d a r. The parametric 
equations of the hypersurface are t = t, 6 = 6, and <j> = tp + fit, and they have 
the generic form x a = x a (y a ). These allow us to compute the tangent vectors 
e" = dx a /dy a , and we obtain efd a = dt + fi<9</>, egd a = de, and e^d a = 0$. From 
all this we find that the nonvanishing components of the extrinsic curvature are 



M 



R 2 y/l-2M/R' 

3Ma sin 2 6 
~ R 2 ^1-2M/W 

K% = ^^2-MjR, 



K\ = 



^l-2M/R = K%. 



As now seen from the shell's interior, the unit normal is n a = d a p, and the tangent 
vectors are efd a = dt, egd a = de, and e°^d a = d^. From this we find that K 9 e = 

1/R = are the only two nonvanishing components of the extrinsic curvature. 
This could have been obtained directly by setting M = in our previous results. 

We have a clear discontinuity in the extrinsic curvature, and Eq. (3.7.11) allows 
us to calculate S ab , the shell's surface stress-energy tensor. After a few lines of 
algebra, we obtain 

S\ = -^(l-VT^M/R), 



at 



3Masm 2 9 
8irR 2 ^/T^2MJE' 



s e = l-M/R-y/l-2M/R _ 
9 8nRy/l - 2M/R r 

These results, while giving us a complete description of the surface stress-energy 
tensor, are not terribly illuminating. Can we make sense of this mess? 
We will attempt to cast S ab in a perfect-fluid form, 

S ab = <iu a u b + p(h ab + u a u b ), (3.10.6) 

in terms of a velocity field u a , a surface density a, and a surface pressure p. How do 
we find these quantities? First we notice that Eq. (3.10.6) implies S a b u b = —au a , 
which shows that u a is a normalized eigenvector of the surface stress-energy tensor, 
with eigenvalue —a. This gives us three equations for three unknowns, the density 
and the two independent components of the velocity field. Once those have been 
obtained, the pressure is found by projecting S ab in the directions orthogonal to u a . 
The rest is just a matter of algebra. 

We can save ourselves some work if we recognize that the shell must move rigidly 
in the ip direction, with a uniform angular velocity ui. Its velocity vector can then 
be expressed as 

w a = 7 (r+wV°), (3.10.7) 



3.10 Slowly rotating shell 



75 



where t a = dy a /dt and tp a = dy a /dip are the Killing vectors of the induced metric 
h a b. In Eq. (3.10.7), w = dtp/dt is the shell's angular velocity in the rotating frame 
of Eq. (3.10.2), and 7 is determined by the normalization condition, h a iu a u b = — 1. 
We can simplify things further if we anticipate that w will be proportional to a. For 
example, neglecting 0(w 2 ) terms when normalizing u a gives 

1 (3.10.8) 



y/l-2M/R 

With these assumptions, we find that the eigenvalue equation produces ui = 
~S%/(~S t t + an d °~ = — After simplification, the first equation becomes 

6Ma 1-2M/R 

u=-ftt-, ; , — , 3.10.9 

R (l-y/l- 2M/R) (1 + 3^/1 - 2M/R) 

and the second is 

a = 4^R i 1 ~ V 1 - 2M / R ) ■ (3.10.10) 

We now have the surface density and the velocity field. The surface pressure can 
easily be obtained by projecting S ab in the directions orthogonal to ti°: p = |(/i & + 
u a u b )S ab = |(5 + a), where S = h ab S ab . This gives p = S%, and 

1 - MIR - J\ - 2M/R 

p= '- — ; v ' — . 3.10.11 

8ttR^1 - 2M/R 

The shell's surface stress-energy tensor can therefore be given an interpretation in 
terms of a perfect fluid of density a, pressure p, and angular velocity ui. When 
R is much larger than 2M, Eqs. (3.10.9)-(3.10.11) reduce to w ~ 3a/(2R 2 ), a ~ 
M/(4nR 2 ), and p ~ M 2 /(16ttR 3 ), respectively. 

The spacetime of a slowly rotating shell offers us a unique opportunity to explore 
the rather strange relativistic effects associated with rotation. We will conclude this 
section with a short description of these effects. 

The metric of Eq. (3.10.1) is the metric outside the shell, and it is expressed 
in a coordinate system that goes easily into a Cartesian frame at infinity. This is 
the frame of the "fixed stars" , and it is this frame which sets the standard of no 
rotation. The metric of Eq. (3.10.5), on the other hand, is the metric inside the 
shell, and it is expressed in a coordinate system that is rotating with respect to the 
frame of the fixed stars. The transformation is given by Eq. (3.10.2), and it shows 
that an observer at constant tp moves with an angular velocity dcji/dt = fi. Inertial 
observers inside the shell are therefore rotating with respect to the fixed stars, with 
an angular velocity S7 in = fi. According to Eq. (3.10.3), this is 

n in = (3.10.12) 

This angular motion is induced by the rotation of the shell, and the effect is known 
as the dragging of inertial frames. It was first discovered in 1918 by Thirring and 
Lense. 

The shell's angular velocity w, as computed in Eq. (3.10.9), is measured in the 
rotating frame. As measured in the nonrotating frame, the shell's angular velocity 
is fi she ii = d<t>/dt = dtp/dt + Q, = w + fi in . According to Eqs. (3.10.9) and (3.10.12), 
this is 

_ 2Ma 1 +2^/1 -2M/R 

"she.. - -£3- {1 _ V1 _ 2M/R){1 + 3V1 _ 2M/R y ( 3 - 10 - 13 ) 
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When R is much larger than 2M, fii n /fi s heii — 4M/(3R), and the internal ob- 
servers rotate at a small fraction of the shell's angular velocity. As R approaches 
2M, however, the ratio approaches unity, and the internal observers find themselves 
corotating with the shell. This is a rather striking manifestation of frame dragging. 
(The phrase "Mach's principle" is often attached to this phenomenon.) This space- 
time, admittedly, is highly idealized, and we may wonder whether corotation could 
ever occur in realistic situations. We will see in Chapter 5 that the answer is yes: 
a very similar phenomenon occurs in the vicinity of a rotating black hole. 



3.11 Null shells 

We saw in Sees. 3.1 and 3.2 that the description of null hypersurfaces requires some 
care and involves interesting subtleties, and we should not be surprised to find that 
the same is true of the description of null surface layers. Our purpose here, in the 
last section of Chapter 3, is to face these subtleties and extend the formalism of thin 
shells, as developed in Sec. 3.7, to the case of a null hypersurface. The presentation 
given here is adapted from Barrabes and Israel (1991). 



3.11.1 Geometry 

As in Sec. 3.7 we consider a hypersurface £ that partitions spacetime into two 
regions i /± in which the metric is when expressed in coordinates x±. Here we 
assume that the hypersurface is null, and our convention is such that ~f~ is in the 
past of S, and ~f + in its future. We assume also that the hypersurface is singular, 
in the sense that the Riemann tensor possesses a 6- function singularity at £. We 
will characterize the Ricci part of this singular curvature tensor, and relate it to the 
surface stress-energy tensor of the shell. 

(We note in passing that the Weyl part of the curvature tensor may also be 
singular at the hypersurface, and that this may happen even in the absence of a 
singular Ricci tensor: the two types of singularity are entirely decoupled. This 
property is peculiar to null hypersurfaces: in the case of timelike or spacelike shells, 
the curvature singularity is completely supported by the Ricci tensor, and the Weyl 
tensor is always smooth. In this section we shall be concerned only with the sin- 
gularity in the Ricci tensor, and we shall have to leave unexplored the interesting 
physical effects associated with a singular Weyl tensor.) 

As in Sec. 3.1.2 we install coordinates 

y a = {\6 A ) 

on the hypersurface, and as in Sec. 3.7.1 we assume that these coordinates are 
the same of both sides of S. We take A to be an arbitrary parameter on the null 
generators of the hypersurface, and we use 6 A to label the generators. We shall see 
below that in general, it is not possible to make A an affine parameter on both sides 
of £. 

In Y ± , £ is described by the parametric relations x±(y a ), and we can intro- 
duce tangent vectors e± = dx±/dy a on each side of the hypersurface. These are 
naturally separated into a null vector k± that is tangent to the generators, and 
two spacelike vectors e± A that point in the directions transverse to the generators. 
Explicitly, 

(Here and below, in order to keep the notation simple, we refrain from using the "±" 
label in displayed equations; this should not cause any confusion.) By construction, 
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these vectors satisfy 

k a k a = = k a e a A . (3.11.2) 
On the other hand, the remaining inner products 

a AB (X,e c ) =g a pe a A e% (3.11.3) 

do not vanish, and we assume that they are the same on both sides of S: 

[(tab] = 0. (3.11.4) 

As in Sec. 3.1.3, we find that it is the two-tensor a AB which acts as a metric on £, 

dsl=a AB de A d6 B , 

and the condition (3.11.4) ensures that the hypersurface possesses a well-defined 
intrinsic geometry. 

As in Sec. 3.1.3 we complete the basis by adding an auxiliary null vector N± 
satisfying 

N a N a = 0, N a k a = -1, N a e a A = 0. (3.11.5) 
This gives us the convenient expression 



g aP = _ k a N P _ N a k P + a ^ e « e ^ (3.11.6) 

for the inverse metric on either side of £ (in the coordinates x±); a AB is the inverse 
of a A s, and it is the same on both sides. 

To complete the geometric setup we must introduce a congruence of geodesies 
that cross the hypersurface. In Sec. 3.7, in which £ was either timelike or space- 
like, the congruence was selected by demanding that the geodesies intersect the 
hypersurface orthogonally: the vector field u± tangent to the congruence was set 
equal (on £) to the normal vector n±. When the hypersurface is null, however, this 
requirement does not produce a unique congruence, because a vector orthogonal to 
k a can still possess an arbitrary component along k a . 

We shall have to give up on the idea of adopting a unique congruence. An 
important aspect of our description of null shells is therefore that it involves an 
arbitrary congruence of timelike geodesies intersecting £. This arbitrariness comes 
with the lightlike nature of the singular hypersurface, and it cannot be removed. 
It can, however, be physically motivated: The arbitrary vector field u± that enters 
our description of null shells can be identified with the four-velocity of a family 
of observers making measurements on the shell; since many different families of 
observers could be introduced to make such measurements, there is no reason to 
demand that the vector field be uniquely specified. 

We therefore introduce a congruence of timelike geodesies 7 that arbitrarily 
intersect the hypersurface. The geodesies are parameterized by proper time r, 
which is adjusted so that r = when a geodesic crosses £; thus, r < in Y~ , and 
r > in y + . The vector tangent to the geodesies is 

u a = =U (3.11.7) 

UT 

To ensure that the congruence is smooth at the hypersurface, we demand that u± 
be "the same" on both sides of £. This means that u a e", the tangential projections 
of the vector field, must be equal when evaluated on either side of the hypersurface: 

[-u a k a ] = = [u Q eS]. (3.11.8) 

If, for example, u" is specified in Y~ , then the three conditions (3.11.8) are suf- 
ficient (together with the geodesic equation) to determine the three independent 
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components of u+ in f + . We note that —u a N a , the transverse projection of the 
vector field, is allowed to be discontinuous at S. 

The proper-time parameter on the timelike geodesies can be thought of as a 
scalar field t{x±) defined in a neighbourhood of S: Select a point x± off the hyper- 
surface and locate the unique geodesic 7 that connects this point to S; the value of 
the scalar field at x± is equal to the proper-time parameter of this geodesic at that 
point. The hypersurface £ can then be defined by the statement 

T(x a ) = 0, 

and the normal vector k+ will be proportional to the gradient of t{x±) evaluated 
at S. It is easy to check that the expression 

8t 

k a = -(-k,u»)— (3.11.9) 

is compatible with Eq. (3.11.7). We recall that the factor — in Eq. (3.11.9) is 
continuous across S. 

3.11.2 Surface stress-energy tensor 

As in Sec. 3.7 we introduce an overlapping coordinate system x a , distinct from 
x±, in a neighbourhood of the hypersurface; the final formulation of our null-shell 
formalism will be independent of these coordinates. We express the metric as a 
distribution-valued tensor: 

9af3 = ©(t) 9+ + 0(-t) g~0, 

where (x M ) is the metric in We assume that in these coordinates, the metric 
is continuous at S: [g a p] = 0; Eq. (3.11.4) is compatible with this requirement. We 
also have [k a ] = [ej] = [N a ] = [u a ] = 0. Differentiation of the metric proceeds as in 
Sees. 3.7.2 and 3.7.3, except that we now write r instead of t, and we use Eq. (3.11.9) 
to relate the gradient of r to the null vector k a . We arrive at a Riemann tensor 
that contains a singular part given by 

Rv%t = -(-*„«") _1 ([r%]fer - [T%]ks)S( T ), (3.11.10) 

where [r a ^ ] is the jump in the Christoffel symbols across S. 

In order to make Eq. (3.11.10) more explicit we must characterize the discon- 
tinuous behaviour of g a p,-y- The condition [g a p] = guarantees that the tangential 
derivatives of the metric are continuous: 

[gaPnW = = [Sa/3,7] e c- 

The only possible discontinuity is therefore in gap^N 1 , the transverse derivative of 
the metric, and we conclude that there must exist a tensor field ^ a such that 

[g a p n ] = -lapk-y. (3.11.11) 

This tensor is given explicitly by -y a = [g a ,a, 7 ]-ZV 7 , and it is now easy to check that 

[^7] = -\b a p*i +7 7 fc /3 -7*7* a )- (3-H-12) 
Substituting this into Eq. (3.11.10) gives 



R* a MS = 1 (7W7 - 7/35^7 - l^pks + lp-yk a k s )6(r), (3.11.13) 
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and we see that k a and j a /3 give a complete characterization of the singular part of 
the Riemann tensor. 

From Eq. (3.11.13) it is easy to form the singular part of the Einstein tensor, 
and the Einstein field equations then give us the singular part of the stress-energy 
tensor: 

Tf = (-JfeX) -1 S a ? S(t), (3.11.14) 

where 

S a/3 = J- (^7^^ + fc^"*" - 7"„* a * /S - l^k v g aP ) 

is the surface stress-energy tensor of the null shell — up to a factor —k^u^ 1 that 
depends on the choice of observers making measurements on the shell. Its expression 
can be simplified if we decompose S"* 3 in the basis (k a , e A , N a ). For this purpose 
we introduce the projections 

1a = lap elk 13 , -jab = lap e A e%, (3.11.15) 

and we use the completeness relation (3.11.6) to find that the vector 7^ ( fc' i admits 
the decomposition 

7> M = - v AB lA B )k a + {a AB lB ) e a A - (^k»k")N a . 

Substituting this into our previous expression for S a ^ and involving once more the 
completeness relation, we arrive at our final expression for the surface stress-energy 
tensor: 

S ap = nk a k p + j A (k a e A + e a A k p ) + pu AB e a A e%. (3.11.16) 

Here, 

can be interpreted as the shell's surface density, 



as a surface current, and 



as a surface pressure. 

The surface stress-energy tensor of Eq. (3.11.16) is expressed in the overlapping 
coordinates x a . As a matter of fact, the derivation of Eq. (3.11.16) relies heavily on 
these coordinates: the introduction of rests on the fact that in these coordinates, 
g a p is continuous at S, so that an eventual discontinuity of the metric derivative 
must be directed along k a . In the next subsection we will remove the need to 
involve the coordinates x a in practical applications of the null-shell formalism. For 
the time being we simply note that while Eq. (3.11.16) is indeed expressed in the 
overlapping coordinates x a , it is a tensorial equation involving vectors (k a and 
e A ) and scalars (/*, j A , and p). This equation can therefore be expressed in any 
coordinate system; in particular, when viewed from the surface stress-energy 
tensor can be expressed in the original coordinates x±. 

3.11.3 Intrinsic formulation 

In Sec. 3.7, the surface stress-energy tensor of a timelike or spacelike shell was 
expressed in terms of intrinsic three-tensors — quantities that can be defined on 
the hypersurface only. The most important ingredients in this formulation were 



80 



Hyp ersurfaces 



h a t„ the (continuous) induced metric, and [K a t], the discontinuity in the extrinsic 
curvature. We would like to achieve something similar here, and remove the need 
to involve an overlapping coordinate system x a to calculate the surface quantities 
H, j A , and p. 

We can expect that the intrinsic description of the surface stress-energy tensor 
of a null shell will involve gab , the nonvanishing components of the induced metric. 
We might also expect that it should involve the jump in the extrinsic curvature of 
the null hypersurface, which would be defined by = k a; p e"e^ = ^(£k9ap) ^" e b- 
Not so. The reason is that there is nothing "transverse" about this object: In the 
case of a timelike or spacelike hypersurface, the normal n a points away from the 
surface, and £ n 9aP truly represents the transverse derivative of the metric; when 
the hypersurface is null, on the other hand, k a is tangent to the surface, and £k9 a p 
is a tangential derivative. Thus, the extrinsic curvature is necessarily continuous 
when the hypersurface is null, and it cannot be related to the tensor 7 a/ 3 defined by 
Eq. (3.11.11). 

There is, fortunately, an easy solution to this problem: we can introduce a 
transverse curvature C f, that properly represents the transverse derivative of the 
metric. This shall be defined by C ab = |(£jv5«/3) e"ef = ^(N a; p + Np ;a )e"e^, or 

C ab = -N a e a a . el (3.11.17) 

To arrive at Eq. (3.11.17) we have used the fact that N a e" is a constant, and 
the identity e".^ = ej ;/3 ef , which states that each basis vector e" is Lie trans- 
ported along any other basis vector; this property ensures that C f,, as defined by 
Eq. (3.11.17), is a symmetric three-tensor. 

In the overlapping coordinates x a , the jump in the transverse curvature is given 

by 

[Cat] = [N a ;p\e a a et 

where we have used Eq. (3.11.12) and the fact that k a is orthogonal to e". We there- 
fore have [C X \] = ^lap^k 13 , [C A \] = ^lape^k 13 = ±-y A , and [Cab] = ^lape^e 1 ^ = 
\lAB, where we have involved Eq. (3.11.15). Finally, we find that the surface 
quantities can be expressed as 

V = -^v AB [C AB ], 3 A = ^v AB [Cxa], p=-±-[C xx ]. (3.11.18) 

We have established that the shell's surface quantities can all be related to the 
induced metric gab and the discontinuity in the transverse curvature C a ft. This 
completes the intrinsic formulation of our null-shell formalism. 



3.11.4 Summary 

A singular null hypersurface S possesses a surface stress-energy tensor characterized 
by tangent vectors k± and e± A , as well as a surface density a surface current j A , 
and a surface pressure p. The surface quantities can all be related to a discontinuity 
in the surface's transverse curvature, 

C ab = -N a e°f } 4, 

which is defined on either side of S in the appropriate coordinate system x±. The 
relations are 
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The surface stress-energy tensor is given by 

S ap = nk a k p + j A (k a e A + e^k?) + pa AB e a A e^ 

and the complete stress-energy tensor of the surface layer is 

Tf= {-k^Y 1 S a ? 5{t). 

In this expression, the factor (— is continuous at the shell, and the vector 
u± = dx±/dT is tangent to an arbitrary congruence of timelike geodesies (which 
represents a family of observers making measurements on the shell). The presence 
of this factor implies that /*, j A , and p are not the physically-measured surface 
quantities; those are instead given by 

^physical = ( — k^U^ /*, ^physical = ( — ^M^) 3^ \ Pphysical = ( — k^U^ p. 

The arbitrariness associated with the choice of congruence is thus limited to a 
single multiplicative factor; the "bare" quantities /*, j A , and p are independent of 
this choice. 

3.11.5 Parameterization of the null generators 

Our null-shell formalism is now complete, and it is ready to be involved in applica- 
tions. We will consider a few in the following subsections, but we first return to a 
statement made earlier, that in general, A cannot be an affine parameter on both 
sides of the hypersurface. We shall justify this here, and also consider what happens 
to ii, j A , and p when the parameterization of the null generators is altered. 

Whether or not A is an affine parameter can be decided by computing k±, the 
"acceleration" of the null vector k±. This is defined on either side of the hypersurface 
by (Sec. 1.3) 

lh -j^l^J Aj ft' , 

and A will be an affine parameter on the y ± side of X if n± = 0. According 
to Eq. (3.11.5), k = —Nak^pk 13 = — N a e".pe^ = C\\, where we have also used 
Eqs. (3.11.2) and (3.11.17). Equation (3.11.18) then relates the discontinuity in the 
acceleration to the surface pressure: 

[ K ] = Snp. (3.11.19) 

We conclude that A can be an affine parameter on both sides of S only when the 
null shell has a vanishing surface pressure. When p ^ 0, A can be chosen to be 
an affine parameter on one side of the hypersurface, but it will not be an affine 
parameter on the other side. 

Additional insight into this matter can be gained from Raychaudhuri's equation, 
which describes the transverse evolution of a congruence of null geodesies (Sec. 2.4). 
In Sec. 2.6, Problem 8, Raychaudhuri's equation was written in terms of an arbi- 
trary parameterization of the null geodesies. When the congruence is hypersurface 
orthogonal, it reads 

dB 1 

— + -6 2 + G ap a a p = k0- %-nT af} k a k p , 

where 6 and u a p are the expansion and shear of the congruence, respectively; the 
equation holds on either side of S. The left-hand side of Raychaudhuri's equation, 
because it depends only on the intrinsic geometry of the hypersurface, is guaranteed 
to be continuous across the shell. Continuity of the right-hand side therefore implies 



[K]8 = 8-ir[T a/3 k a k 13 ]. 



(3.11.20) 
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This relation shows that [k] ^ (and p ^ 0) whenever the component T a fjk a W 
of the stress-energy tensor is discontinuous at the shell. Thus, we conclude that A 
cannot be an affine parameter on both sides of £ when [T a pk a kP] ^ 0. (Notice that 
this conclusion breaks down when 9 = 0, that is, when the shell is stationary.) 

Recalling (Sec. 2.4.8) that the expansion 9 is equal to the fractional rate of change 
of the congruence's cross-sectional area, we find that with the help of Eq. (3.11.19), 
Eq. (3.11.20) can be expressed as 

P^r dS + [T a pk a k p }dS = 0, (3.11.21) 

where dS = y/ad 2 9 is an element of cross-sectional area on the shell (Sec. 3.2.2). 
This equation has a simple interpretation: The first term represents the work done 
by the shell as it expands or contracts, while the second term is the energy absorbed 
by the shell from its surroundings; Eq. (3.11.21) therefore states that all of the 
absorbed energy goes into work. 

Having established that A cannot, in general, be an affine parameter on both 
sides of the hypersurface, let us now investigate how a change of parameterization 
might affect the surface density surface current j A , and surface pressure p of 
the null shell. Because each generator can be reparameterized independently of any 
other generator, we must consider transformations of the form 

X^X(X,9 A ). (3.11.22) 

The question before us is: How do /*, j A , and p change under such a transformation? 

To answer this we need to work out how the transformation of Eq. (3.11.22) 
affects the vectors k a , e% and N a . We first note that the differential form of 
Eq. (3.11.22) is 

d\=e f} d\ + c A d9 A , (3.11.23) 



where 



e -Ui' cA ^wk (3 - 1L24) 

both e 13 and ca depend on y a = (X,9 A ), but because they depend on the intrin- 
sic coordinates only, we have that [e@] = = [ca]- A displacement within the 
hypersurface can then be described either by 

dx a = k a dX + e^d9 A , 

where k a = (dx a /dX)e A and e a A = (dx a /89 A ) x , or by 

dx a = k a d\ + e'\d9 A , 

where k a = (dx a /d\)g A and = (dx a /d9 A )\] these relations hold on either side 
of S, in the relevant coordinate system x±. Using Eq. (3.11.23), it is easy to see 
that the tangent vectors transform as 

k a = e - p k a , e < X = e a A -c A e- p k a (3.11.25) 

under the reparameterization of Eq. (3.11.22). It may be checked that the new basis 
vectors satisfy the orthogonality relations (3.11.2), and that the induced metric gab 
is invariant under this transformation: gab = g a p&A&B = 9a@ e A e B — °ab- To 
preserve the relations (3.11.5) we let the new auxiliary null vector be 

N a =e' 3 N a + l {a AB CAC B )e- f3 k a - (a AB c B )e% (3.11.26) 
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This transformation ensures that the completeness relation (3.11.6) takes the same 
form in the new basis. 

It is a straightforward (but slightly tedious) task to compute how the transverse 
curvature C a \, changes under a reparameterization of the generators, and to then 
compute how the surface quantities transform. You will be asked to go through this 
calculation in Sec. 3.13, Problem 8. The answer is that under the reparameterization 
of Eq. (3.11.22), the surface quantities transform as 

p = e /3 /j, + 2c A j A + {a AB c A c B )e-' 3 p, 
t = j A + {v AB c B )e-ep (3.11.27) 
p = e~@p. 

These transformations, together with Eq. (3.11.25), imply that the surface stress- 
energy tensor becomes S a/3 = e~ /3 S al3 . We also have (— fc^u' 1 ) -1 = e^(— fc M w' i ), 
and these results reveal that the combination (—k l _ l u^)~ 1 S al3 is invariant under 
the reparameterization. This, finally, establishes the invariance of T% p , the full 
stress-energy tensor of the surface layer. 

As a final remark, we note that under the reparameterization of Eq. (3.11.22), 
the physically-measured surface quantities transform as 

/^physical = 6 ^physical + 2,Ca^ J 'physical + ( CT B C A C B )Pphy sicalj 

physical = e /3 j A hysical + (a AB c B ) Pphysicah (3.11.28) 

Pphysical — Pphysical j 

we see in particular that the physically-measured surface pressure is an invariant. 

3.11.6 Imploding spherical shell 

For our first application of the null-shell formalism, we take another look at the 
gravitational collapse of a thin spherical shell, a problem that was first considered 
in Sec. 3.9. Here we imagine that the collapse proceeds at the speed of light, and 
that the thin shell lies on a null hypersurface X. We take spacetime to be flat inside 
the shell (in and write the metric there as 

ds 2 _ = -dt 2 _+dr 2 +r 2 dtt 2 , 

in terms of spatial coordinates (r, 6, <f>) and a time coordinate t_ . The metric outside 
the shell (in f + ) is the Schwarzschild solution, 

ds\ = -fdt\ + f- 1 dr 2 + r 2 dn 2 , 

expressed in the same spatial coordinates but in terms of a distinct time t+; here, 
/ = 1 — 2M/r and M designates the gravitational mass of the collapsing shell. 

As seen from Y~ , the null hypersurface S is described by the equation t_ +r = 
V- = constant, which means that the induced metric on S is given by ds 2 ^ = r 2 d£l 2 . 
As seem from on the other hand, the hypersurface is described by t + +r*(r) = 
v + = constant, where r*(r) = J / _1 dr = r + 2Mln(r/2M — 1), and this gives rise to 
the same induced metric. From these considerations we see that it was permissible 
to express the metrics of in terms of the same spatial coordinates (r, 6, <f>), but 
that t + cannot be equal to f_. The induced metric on the shell is 

(jab d6 A d6 B = X 2 {dd 2 + sin 2 6 d<p 2 ) , 

where we have set 6 A = (6, (j)) and identified — r with the parameter A on the null 
generators of the hypersurface; we shall see that here, A is an affine parameter on 
both sides of S. 
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As seen from , the parametric equations x" = x"(X,6 A ) describing the 
hypersurface have the explicit form t_ = v_ + A, r = —A, 6 = 6, and <j> = <j>. These 
give us the basis vectors k a d a = dt — d r , egd a = d#, and e^d a = 8$, and the basis 
is completed by N a dx a = —\{dt — dr). From all this and Eq. (3.11.17) we find that 
the nonvanishing components of the transverse curvature are 

C AB = ^°AB- 

The fact that C^ x = confirms that A = — r is an affine parameter on the ~f~ side 
ofS. 

As seen from Y + , the parametric equations describing the hypersurface are t+ = 
v+ — r*(— A), r = —A, 6 = 6, and <j> = <j>. The basis vectors are k a 8 a = f~ x 8t — 8 r , 
eg8 a = 8$, e^d a = 0$, and N a dx a = —\{fdt — dr). The nonvanishing components 
of the transverse curvature are now 

C ab = ^, aAB - 

The fact that C\ x = confirms that A = — r is an affine parameter on the f + side 
of S; A is therefore an affine parameter on both sides. 

The angular components of the transverse curvature are discontinuous across 
the shell: [Cab] = — (M /r 2 )uAB- According to Eq. (3.11.18), this means that the 
shell has a vanishing surface current j A and a vanishing surface pressure p, but that 
its surface density is 

M 

V 4 7rr 2 ■ 

We have therefore obtained the very sensible result that the surface density of a 
collapsing null shell is equal to its gravitational mass divided by its (ever decreasing) 
surface area. Notice that /^physical = /* for observers at rest in ~f~ . Because of the 
focusing action of the null shell, however, these observers do not remain at rest 
after crossing over to the ^ + side: A simple calculation, based on Eq. (3.11.8), 
reveals that an observer at rest before crossing the shell will move according to 
dr/dr = —(E 2 — /) 1//2 after crossing the shell; the energy parameter E varies from 
observer to observer, and is related by E = 1 — M/r-z to the radius at which a 
given observer crosses the hypersurface. 



3.11.7 Accreting black hole 

In our second application of the null-shell formalism, we consider a nonrotating 
black hole of mass (M — m) which suddenly acquires additional material of mass m 
and angular momentum J = aM. We suppose that the accretion process is virtually 
instantaneous, that the material falls in with the speed of light, and that J <C M 2 . 
We idealize the accreting material as a singular stress-energy tensor supported on 
a null hypersurface S. 

The spacetime in the future of S — in f + — is that of a slowly rotating black 
hole of mass M and (small) angular momentum aM. We write the metric in y + 
as in Eq. (3.10.1), 

ds 2 = -f dt 2 + f- 1 dr 2 + r 2 dn 2 - — sin 2 6 dtd<f>, 

r 

where / = 1 — 2M/ r ; this is the slow-rotation limit of the Kerr metric, and through- 
out this subsection we will work consistently to first order in the small parameter 
a. 

As seen from y + , the null hypersurface S is described by v = t + r* =0, where 
r* = J f~ x dr = r + 2Mln(r/2M — 1); you may check that in the slow-rotation 
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limit, every surface v = constant is null. It follows that the vector k a = g al3 (—dpv) 
is normal to X and tangent to its null generators. We have 

1 2Ma 
k"d a = - d t - d r + -^r— d<p, 
f r 3 f 

and from this expression we deduce four important properties of the generators. 
First, the generators are affinely parameterized by A = — r. Second, as measured by 
inertial observers at infinity, the generators move with an (ever increasing) angular 
velocity 

d<f> _ 2Ma 

— = ^generators — 3 ■ 

Third, 6 is constant on each generator. And fourth, integration of d<p/{—dr) = 
2Ma/(r 3 f) reveals that 

^ + ^( 1 + 2^ ln/ ) 
also is constant on the generators. 

We shall use y a = (A = —r, 6, ip) as coordinates on S; as we have just seen, these 
coordinates are well adapted to the generators, and this property is required by the 
null-shell formalism. Remembering that dt = —dr/f and d(f> = dtp — (2Ma/r 3 f) dr 
on £, we find that the induced metric is 

a AB d9 A d9 B = r 2 (d6 2 + sin 2 6 dip 2 ), 

and that the hypersurface is intrinsically spherical. 

The parametric description of £, as seen from f + , is x a (—r,6,ip), and from 
this we form the tangent vectors e" = k a , e# = 5g, and = 5%. The basis is 
completed by N a dx a = \{-fdt + dr). From Eq. (3.11.17) we obtain 



, ZMa . 2 „ _ , / 
y xi> = — sm 6 > C ab = 2^ aAB 



for the nonvanishing components of the transverse curvature. 

The spacetime in the past of S — in y~ — is that of a nonrotating black hole 
of mass (M — m). Here we write the metric as 

ds 2 _ = -F dt 2 + F' 1 dr 2 + r 2 (d6 2 + sin 2 9 dip 2 ) , 

in terms of a distinct time coordinate i and the angles 6 and ip; we also have F = 
1 — 2(M — m)/r. This choice of angular coordinates implies that inertial observers 
within f~ corotate with the shell's null generators; this is another manifestation 
of the dragging of inertial frames, a phenomenon first encountered in Sec. 3.10. As 
we shall see presently, this choice of coordinates is dictated by continuity of the 
induced metric at S. 

The mathematical description of the hypersurface, as seen from f~, is identical 
to its external description provided that we make the substitutions t — > i , <p — > ip, 
M — > M — m, and a — > 0. According to this, the induced metric on S is still 
given by ds| = r 2 (d6 2 +sin 2 6 dip 2 ), as required. The basis vectors are now k a d a = 
F -1 <9 f - d r , egd a = de, e^d a = d^, and N a dx a = \{—Fdi + dr). This gives us 

F 

C AB = ^ °AB 

for the nonvanishing components of the transverse curvature. 

The transverse curvature is discontinuous at S, and Eqs. (3.11.18) allow us to 
compute the shell's surface quantities. Because the generators are affinely param- 
eterized by — r on both sides of the shell, we have that p = — the shell has a 
vanishing surface pressure. On the other hand, its surface density is given by 

m 

^ 4-itr 2 ' 
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the ratio of the shell's gravitational mass m to its (ever decreasing) surface area 
4nr 2 . Thus far our results are virtually identical to those obtained in the preceding 
subsection. What is new in this context is the presence of a surface current j A , 
whose sole component is 

3Ma 
J 8irr 4 

This comes from the shell's rotation, and the fact that the situation is not entirely 
spherically symmetric. 

To better understand the physical significance of the surface current, we express 
the shell's surface stress-energy tensor, 

S a ? = pk a k p +f{k a e^ + e%kP), 

in terms of the vector £ a = k a + {j^ In) e^. This vector is null (when we appro- 
priately discard terms of order a 2 in the calculation of g a pt a t^), and it has the 
components 

i a d a = j d t - d r + - fJfl u id<90 

in the coordinates x a = (t, r, 6, <j>) used in y + ; we have set 

_ 2Ma 3Ma 

"fluid = o H 7; J- 

r 6 2mr 

The shell's surface stress-energy tensor is now given by the simple expression 

gap = ^focf?^ 

which corresponds to a pressureless fluid of density n moving with a four-velocity 
l a . We see that the fluid is moving along null curves (not geodesies!) that do not 
coincide with the shell's null generators, and that the motion across generators is 
created by a mismatch between f2fl u id, the fluid's angular velocity, and fi ge nerator S j 
the angular velocity of the generators. The mismatch is directly related to j A : 

n =n -n - 3 - - 3Ma t 

J 'relative — J 'fluid J 'generators — — n J- 

fi 2mr 

Notice that the fluid rotates faster than the generators, which share their angular 
velocity with inertial observers within f~; such a phenomenon was encountered 
before, in the context of the stationary rotating shell of Sec. 3.10. But notice also 
that ^relative decreases to zero as r approaches 2M: the fluid ends up corotating 
with the generators when the shell crosses the black-hole horizon. 



3.11.8 Cosmological phase transition 

In this third (and final) application of the formalism, we consider an intriguing (but 
entirely artificial) cosmological scenario according to which the universe was initially 
expanding in two directions only, but was then made to expand isotropically by a 
sudden explosive event. 

The f~ region of spacetime is the one in which the universe is expanding in the 
x and y directions only. Its metric is 

ds 2 _ = -dt 2 + a 2 (t) (dx 2 +dy 2 ) + dz 2 _, 

and the scale factor is assumed to be given by a(t) oc f 1 / 2 . The cosmological 
fluid moves with a four- velocity u a = dx a /dt, and it has a density and (isotropic) 
pressure given by p- = p- = l/(32nt 2 ), respectively. 
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In the y + region of spacetime, the universe expands uniformly in all three 
directions. Here the metric is 

ds\ = -dt 2 + a 2 (t) (dx 2 + dy 2 + dz\) , 

with the same scale factor a(t) as in ~f~ , and the cosmological fluid has a density 
and pressure given by p + = 3p + = 3/(32-7rf 2 ), respectively; this corresponds to a 
radiation-dominated universe. 

The history of the explosive event that changes the metric from g~p to traces 
a null hypersurface £ in spacetime. This surface moves in the positive z± direction, 
and as we shall see, it supports a singular stress-energy tensor. The "agent" that 
alters the course of the universe's expansion is therefore a null shell. 

As seen from f~, the hypersurface is described by t = Z- + constant, and the 
vector k a d a = dt + d z is tangent to the null generators, which are parameterized 
by t. In fact, because k^k 13 = 0, we have that t is an affine parameter on this 
side of the hypersurface. The coordinates x and y are constant on the generators, 
and we use them, together with t, as intrinsic coordinates on S. We therefore have 
y a = (t,6 A ), 6 A = (x,y), and the shell's induced metric is 

o AB dd A dd B = a 2 {t){dx 2 + dy 2 ). 

The remaining basis vectors are e"d a = d x , e^d a = d y , and N a dx" = — \{dt+dz-). 
The nonvanishing components of the transverse curvature are 

C ab = lft OAB - 

We note that on the V~ side of E, the null generators have an expansion given 
by 6 = k a . a = 1/t, and that T ae ,k a kP = p_ + p_ = l/(167rf 2 ), where T a/3 is the 
stress-energy tensor of the cosmological fluid. 

As seen from f + , the description of the hypersurface is obtained by integrating 
dt = a(t)dz + , and k a d a = dt + a~ x d z is tangent to the null generators. We 
note that t is not an affine parameter on this side of the hypersurface: we have 
that k^pk 13 = (2t)~ 1 k a . The remaining basis vectors are e"d a = d x , e^<9 a = d y , 
N a dx a = —\{dt + adz + ), and the nonvanishing components of the transverse 
curvature are now , 

C « = 2? C+ AB = Xt aAB - 

On this side of S, the generators have an expansion also given by 6 = 1/t (since 
continuity of 6 is implied by continuity of the induced metric), and T a pk a kP = 
P+ +p+ = l/(8nt 2 ). 

The fact that t is an affine parameter on one side of the hypersurface only tells 
us that the shell must possess a surface pressure. In fact, continuity of Cab across 
the shell implies that p is the only nonvanishing surface quantity. It is given by 

1 



the negative sign indicating that this surface quantity would be better described 
as a tension, not a pressure. The shell's surface stress-energy tensor is S al3 = 
pa AB e°^e%. If we select observers comoving with the cosmological fluid as our 
preferred observers to make measurements on the shell, then —k a u a = 1 and the 
full stress-energy tensor of the singular hypersurface is T^P = S a ^5(t — t^,), with 
t-£ denoting the time at which a given observer crosses the shell. We see that for 
these observers, — p is the physically-measured surface tension. 

Finally, we note that the expressions — p = l/(16-7rf), 6 = 1/t, and [T a pk a kP] = 
l/(167rf 2 ) are compatible with the general relation — p6 = [T a pk a kP] derived in 
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Sec. 3.11.5. This shows that the energy released by the shell as it expands is 
absorbed by the cosmological fluid, whose density increases by a factor of p+/p~ = 
3; this energy is provided by the shell's surface tension. 



During the preparation of this chapter I have relied on the following references: 
Barrabes and Israel (1991); Barrabes and Hogan (1998); de la Cruz and Israel 
(1968); Israel (1966); Misner, Thorne, and Wheeler (1973); Musgrave and Lake 
(1997); and Wald (1984). 
More specifically: 

Sections 3.1, 3.2, and 3.3 are based partially on unpublished lecture notes by 
Werner Israel. Sections 3.4, 3.5, and 3.9 are based on Israel's paper. Section 3.6 
is based on Sec. 10.2 of Wald. Sections 3.7 and 3.11 (as well as Problem 9 below) 
are based on Barrabes and Israel. Section 3.8 is based on Exercise 32.4 of Misner, 
Thorne, and Wheeler. Section 3.10 is based on de la Cruz and Israel. Finally, 
the examples of Sees. 3.11.7 and 3.11.8 are adapted from Musgrave & Lake, and 
Barrabes & Hogan, respectively. 



Warning: The results derived in Problem 1 are used in later portions of this book. 
1. We consider a hypersurface T = constant in Schwarzschild spacetime, where 



We use (r, 6, <j>) as coordinates on the hypersurface. 

a) Calculate the unit normal n a , and find the parametric equations describing 

the hypersurface. 

b) Calculate the induced metric h^. 

c) Calculate the extrinsic curvature K^. Verify that your results agree with 

those of Sec. 3.6.6, and show that K is equal to the expansion of the 
geodesic congruence considered in Sec. 2.3.7. 

d) Prove that when it is expressed in terms of the coordinates (T, r, 6, <f>), the 

Schwarzschild metric takes the form 



This shows very clearly that the sections T = constant are intrinsically 
flat. [This coordinate system was discovered independently by Painleve 
(1921) and Gullstrand (1922).] 

2. A four-dimensional hypersurface is embedded in a flat, five-dimensional space- 
time. We use coordinates z A in the five-dimensional world, and express the 
metric as 



we let uppercase latin indices run from to 4. In the four-dimensional world 
we use coordinates x a = (t, x, 6, <t>)- The hypersurface is defined by parametric 
relations z A (x a ). Explicitly, 
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ds 2 = -dT 2 + (dr + ^2M/rdT) 2 + r 2 dti 2 . 



ds 2 = t]ab dz A dz B = -(dz ) 2 + (dz 1 ) 2 + (dz 2 ) 2 + (dz 3 ) 2 + (dz 4 ) 2 ; 



z = a sinh(f/a) 



z 1 = a cosh(t/a) cosx, 



z 2 = a cosh(t/a) sin \ cos 6, 
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z 3 = a cosh(i/a) sin x sin 6 cos <f>, z 4 = a cosh(i/a) sin x sin 9 sin <j>, 
where a is a constant. 

a) Compute the unit normal n A and the tangent vectors e A = dz A /dx a to 

the hypersurface. 

b) Compute the induced metric g a p. What is the physical significance of this 

four-dimensional metric? Does it satisfy the Einstein field equations? 

c) Compute the extrinsic curvature K a p. Use the Gauss-Codazzi equations 

to prove that the induced Riemann tensor can be expressed as 

Ra/3-yd = {da-jdpS ~ 9ad9P"/) ■ 

This implies that the four-dimensional hypersurface is a spacetime of 
constant Ricci curvature. 

3. In this problem we consider a spherically symmetric space at a moment of 
time symmetry. We write the three-metric as 

ds 2 =d£ 2 +r 2 (£) dtf, 

where £ is proper distance from the centre. 

a) Show that in these coordinates, the mass function introduced in Sec. 3.6.5 

is given by 

m(r) = r -[l-(dr/dl) 2 ]. 

b) Solve the constraint equations for a uniform energy density p on the hy- 

persurface. Make sure to enforce the asymptotic condition r(£ — > 0) — > £, 
so that the three-metric is regular at the centre. 

c) Prove that r(£) can be no larger than r max = >/3/87r/9. 

d) Prove that 2m(r max ) = r max , and that m(r max ) is the maximum value of 

the mass function. 

e) What happens when £ — » 7rr max ? 

4. Prove the statement made toward the end of Sec. 3.7.5, that [K a t] = is 
a sufficient condition for the regularity of the full Riemann tensor at the 
hypersurface S. 

5. Prove that the surface stress-energy tensor of a thin shell satisfies the conser- 
vation equation 

s a | 6 = -4i a L 

where j a = T a pe"n' 3 . Interpret this equation physically. (Consider the case 
where the shell is timelike.) 

6. The metric 

ds 2 = -dt 2 +d£ 2 +r 2 (£)dn 2 , 

where r(£) = £ when < £ < £ and r(£) = 2£ - £ when £ < £ < 2£ , 
describes a spacetime with closed spatial sections. (What is the volume of a 
hypersurface t = constant?) The spacetime is flat in both f~ (£ < £ ) and 
y + (£ > £o), but it contains a surface layer at £ = £ . 

a) Calculate the surface stress-energy tensor of the thin shell. Express this 
in terms of a velocity field u a , a density a, and a surface pressure p. 
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b) Consider a congruence of outgoing null geodesies in this spacetime, with 

its tangent vector k a = —d a (t — I). Calculate 6, the expansion of this 
congruence. Show that it abruptly changes sign (from positive to nega- 
tive) at I = l . The surface layer therefore produces a strong focusing of 
the null geodesies. 

c) Use Raychaudhuri's equation to prove that the discontinuity in dQjdl is 

precisely accounted for by the surface stress-energy tensor. 

7. Two Schwarzschild solutions, one with mass parameter ra_, the other with 
mass parameter m+, are joined at a radius r = R(t) by means of a spherical, 
massive thin shell. Here, r denotes proper time for an observer comoving 
with the shell. It is assumed that m_ is the interior mass (m+ is the exterior 
mass), that ro+ > ra_, and that R(t) > 2m+ for all values of r. The shell's 
surface stress-energy tensor is given by 

S ab = (a+ p)u a u b + ph ab , 

where u a = dy a /dr is the fluid's velocity field, ct(t) the surface density, p(r) 
the surface pressure, and /i f, the induced metric. 

a) Derive, and interpret physically, the equation 

*(rf,-0. 

b) Find the values of a and p which allow for a static configuration: R(t) = 

Ro = constant. Verify that both a and p are positive. [The stability of 
these static configurations was examined by Brady, Louko, and Poisson 
(1991).] 

8. Derive the relations (3.11.26). 

9. Let spacetime be partitioned into two regions with metrics 

ds\ = -f± dv 2 + 2 dvdr + r 2 dtt 2 . 

We assume that the coordinate system (v,r,6,<f>) is common to both f~ 
and f + . (In each region we could introduce a conventional time coordinate 
t± defined by dt± = dv — dr / f±, but it is much more convenient to work 
with the original system.) In f~ we set /_ = 1 — ro/r, so that the metric 
is a Schwarzschild solution with mass parameter M = \tq. In ~f + we set 
/+ = 1 — (r/ro) 2 , so that the metric is a de Sitter solution with cosmological 
constant A = 3/ro 2 . (This metric is a solution to the modified Einstein field 
equations, G a p + Ag a/ 3 = 0.) The boundary S between the two regions is the 
null surface r = ro, the common horizon of the Schwarzschild and de Sitter 
spacetimes. 

Using y a = (v,6,<f>) as coordinates on S, calculate the surface quantities ft, 
j A , and p associated with the null shell. Explain whether your results are 
compatible with the general relation p8 = [T a pk a k^] derived in Sec. 3.11.5. 



Chapter 4 
Lagrangian and 
Hamiltonian formulations 
of general relativity 



Variational principles play a central role in virtually all areas of physics, and general 
relativity is no exception. This chapter is devoted to a general discussion of the 
Lagrangian and Hamiltonian formulations of field theories in curved spacetime, 
with a special focus on general relativity. 

The Lagrangian formulation of a field theory (Sec. 4.1) begins with the introduc- 
tion of an action functional, which is usually defined as an integral of a Lagrangian 
density over a finite region f of spacetime. As we shall see, general relativity 
is peculiar in this respect, as its action involves also an integration over df ', the 
boundary of the region f; this is necessary for the well-posedness of the variational 
principle. We will, in this chapter, provide a systematic treatment of the boundary 
terms in the gravitational action. 

The Hamiltonian formulation of a field theory (Sec. 4.2) involves a decomposi- 
tion of spacetime into space and time. Geometrically, this corresponds to a foliation 
of spacetime into nonintersecting spacelike hypersurfaces S. In this 3 + 1 decom- 
position, the spacetime metric g a p is decomposed into an induced metric hat, a 
shift vector N a , and a lapse scalar TV; while the induced metric is concerned with 
displacements within S, the lapse and shift are concerned with displacements away 
from the hypersurface. The Hamiltonian is a functional of the field configuration 
and its conjugate momentum on S. In general relativity, the Hamiltonian is a 
functional of /i f, and its conjugate momentum p ab , which is closely related to the 
extrinsic curvature of the hypersurface £; the lapse and shift are freely specifiable, 
and they do not appear in the Hamiltonian as dynamical variables. The gravita- 
tional Hamiltonian inherits boundary terms from the action functional; those are 
defined on the two-surface S formed by the intersection of df and S. 

There is a close connection between the gravitational Hamiltonian and the to- 
tal mass M and angular momentum J of an asymptotically-flat spacetime; this 
connection is explored in Sec. 4.3. We will see that the value of the gravitational 
Hamiltonian for a solution to the Einstein field equations depends only on the con- 
ditions at the two-dimensional boundary S. When the spacetime is asymptotically 
flat and S is pushed to infinity, the Hamiltonian becomes M if the lapse and shift 
are chosen so as to correspond to an asymptotic time translation. For an alternative 
choice of lapse and shift, corresponding to an asymptotic rotation about an axis, 
the Hamiltonian becomes J, the component of the angular-momentum vector along 
this axis. These Hamiltonian definitions for mass and angular momentum form the 
starting point, in Sec. 4.3, of a rather broad review of the different notions of mass 
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and angular momentum in general relativity. 



4.1 Lagrangian formulation 

4-1.1 Mechanics 

In the Lagrangian formulation of Newtonian mechanics, one is given a Lagrangian 
<?)> a function of the generalized coordinate q and its velocity q = dq/dt. One 
then forms an action functional S[q], 



S[q] = t L(q, 
Jti 



q)dt, (4.1.1) 



by integrating the Lagrangian over a selected path q(t). The path that satisfies the 
equations of motion is the one about which S[q] is stationary: Under a variation 
Sq(t) of this path, restricted by 

5q(h) =5q(t 2 ) =0 (4.1.2) 

but otherwise arbitrary in the interval t\ < t < t 2 , the action does not vary, SS = 0. 
The change in the action is given by 



5S = [ 61. dt 

f 



(dL . dL \ 

«2 r t 2 



dL . \ 12 f t2 fdL ddL\ £J 



where, in the last step, we have used Sq = d(6q)/dt and integrated by parts. The 
boundary terms vanish by virtue of Eq. (4.1.2). Because the variation is arbitrary 
between t\ and t 2 , 

This is the Euler- Lagrange equation for a one-dimensional mechanical system. Gen- 
eralization to higher dimensions is immediate. 

4.1.2 Field theory 

We now consider the dynamics of a field q{x a ) in curved spacetime. Although this 
field could be of any type (scalar, vector, tensor, spinor), for simplicity we shall 
restrict our attention to the case of a scalar field. 

In the Lagrangian formulation of a field theory, one is given an arbitrary region 
f of the spacetime manifold, bounded by a closed hypersurface df. One is also 
given a Lagrangian density J?(q,q, a ), a scalar function of the field and its first 
derivatives. The action functional is then 

S[q]= [ J?(q,q, a )V=9d 4 x. (4.1.4) 
Jy 

Dynamical equations for q are obtained by introducing a variation 6q(x a ) that is 
arbitrary within f but vanishes everywhere on df, 

Sq\ gr =0, (4.1.5) 
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and by demanding that 6S vanish if the variation is about the actual path q(x a ). 
Equation (4.1.5) is the field-theoretical counterpart to Eq. (4.1.2). 

Upon such a variation (we use the notation if' = <9if/<9(/, if" = Sif/Sq^), 



6S 



/[ 

Jy 



= [ (£"5q + ^ a 5q, a )yf=g~d A x 
Jy 

[if' 5q + (J? a Sq) ;a - J? a Jq] d A X 

= [ (if' - if? a )<W^d 4 £ + / Jf a SqdS a , 
Jy Jey 

where Gauss' theorem (Sec. 3.3) was used in the last step. The surface integral 
vanishes by virtue of Eq. (4.1.5), and because Sq is arbitrary within f, we obtain 

<9if <9if 

6S = => v a £L-?fL = o. (4.1.6) 
dq, a dq 

This is the Euler-Lagrange equation for a single scalar field q. Generalization to a 
collection of fields is immediate, and the procedure can be taken over to fields of 
arbitrary tensorial or spinorial types. 

As a concrete example, let us consider a Klein-Gordon field ip with Lagrangian 
density 

We have if" = -0 a/3 V,/3, = -9 al3 ip;ap, and if' = -ro 2 V>- The Euler-Lagrange 
equation becomes 

9 aP taP - m 2 ip = o, 
which is the curved-spacetime version of the Klein-Gordon equation. 



4-1-3 General relativity 

The action functional for general relativity contains a contribution So[g] from the 
gravitational field g a p and a contribution Sm [<t>] d] from the matter fields, which we 
collectively denote </>. 

The gravitational action contains a Hilbert term I H [g\/{l&-K), a boundary term 
7b[<?]/(167t), and a nondynamical term 7 /(167r) that affects the numerical value of 
the action but not the equations of motion. More explicitly, 

So [g] = ^ (i H [g] + Ib [g] - /„) , (4.1.7) 

where 

i H [g] = I RV=gd 4 x, (4.1.8) 

Jy 

I B [g] = 2<f eK\h\^ 2 dPy, (4.1.9) 
Jay 

J = 2 I eK \h\ 1 / 2 d 3 y. (4.1.10) 
Jay 

Here, R is the Ricci scalar in f, K is the trace of the extrinsic curvature of df, e 
is equal to +1 where d"V is timelike and —1 where d"V is spacelike (it is assumed 
that df is nowhere null), and h is the determinant of the induced metric on df. 
Coordinates x a are used in Y , and coordinates y a are used on df . The factor of 
(167r) _1 on the right-hand side of Eq. (4.1.7) will be seen to give rise to the factor 
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of 8ir in the Einstein field equations. The role of Ib [g] in the variational principle 
will be elucidated below. The presence of I i n the action will also be explained, 
and this explanation will come with a precise definition for the quantity K . 
The matter action is taken to be of the form 

S M [4>;g}= [ ^{.4>A^9aa)^F9d A x, (4.1.11) 

for some Lagrangian density _£f . As Eq. (4.1.11) indicates, it is assumed that only 
g a p, and none of its derivatives, appears in the matter action. This assumption is 
made for simplicity, and it could easily be removed. 
The complete action functional is therefore 

%; 4>] = J (J^ + V^g^x + -L j e(K - JC )|/i| 1/2 d 3 y. (4.1.12) 

The Einstein field equations, G a p = 8nT a p, are recovered by varying S[g,<fi] with 
respect to g a p. The variation is subjected to the condition 

6g aP \ gr =0. (4.1.13) 

This implies that /i f, = <? a ,ge"ej, the induced metric on df, is held fixed during 
the variation. 

4-1-4 Variation of the Hilbert term 

It is convenient to use the variations Sg" 13 instead of 5g a p. These are of course not 
independent: the relations g^g^p = S"^ imply 

Sg a p = -gowgpvW- (4.1.14) 

We recall (from Sec. 1.7) that the variation of the metric determinant is given by 
<51n \g\ = g al3 5g al 3 = -g a pSg al3 , which implies 

sV=g = -lv=gg a pSg aP - (4.1.15) 

We also recall (from Sec. 1.2) that although r a ^ is not a tensor, the difference 
between two sets of Christoffel symbols is a tensor; the variation ST"^ is therefore 
a tensor. 

We now proceed with the variation of the Hilbert term in the gravitational 
action: 

SIh = [ S(g a PR aP V=g)d 4 x 

= I (R a pV=gtg aP + g al3 V=gSR a p + R6V=g)d 4 x 

= J (R aP - l - Rg a ^j dg a0 V^gd 4 x + J g aP 6R a pV=gd 4 x. 

In the last step we have used Eq. (4.1.15). The first integral seems to give us what 
we need for the left-hand side of the Einstein field equations, but we must still 
account for the second integral. 

Let us work on this integral. We begin with SR a p, which we calculate in a local 
Lorentz frame at a point P: 

5R aP 4 5(rv,-r^) 
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Here, covariant differentiation is denned with respect to the reference metric g a p, 
about which the variation is taken. We notice that the last expression is tensorial; 
it is therefore valid in any coordinate system. We have found 

g^5R aP = dv^, dv» = g af, 5r» afl - g^ST^. (4.1.16) 

We use the "slash" notation fiv^ 1 to emphasize the fact that fiv^ 1 is not the variation 
of some quantity v^ 1 . Using Eq. (4.1.16), the second integral in 5Ih becomes 

Jar 

= I e3u"n„|A| 1/2 d 3 y, 
Jar 

where n M is the unit normal to df and e = n^n^ = ±1. 

We must now evaluate fiv^ri^, keeping in mind that on df, 5g a p = = Sg al3 . 
Under these conditions, 

STl * a p\d-r = \ a >1V {&9va,l3 +&9vl3,a -5g a p >v ), 

and substituting this into Eq. (4.1.16) yields fiv^ = <? a/3 (5<? Mj g, a — Sg a p^), so that 

n"3u„| 8y = n^ l {en a n 13 + h a/3 )(dg^, a - 6g a/3 ^) 

= n»h al3 (5g^, a - 5g a p tfl ). 

In the first line we have substituted the completeness relations g al3 = en a n' 3 + h al3 , 
where h a P = h ab e"e^ (see Sec. 3.1). To obtain the second line we have multiplied 
n a n^ by the antisymmetric quantity within the brackets. Now, because 6g a p van- 
ishes everywhere on df ', its tangential derivatives must vanish also: 6g a p n e2 = 0. 
It follows that h^^dg^p^a = 0, and we finally obtain 

n"^U = -h^Sg^n". (4.1.17) 

This is nonzero because the normal derivative of 6g a p is not required to vanish on 
the hypersurface. 

Gathering the results, we obtain 

SIh= [ G aP 6g^^d 4 x- I eh aP 5g a ^\h\ 1 l 2 d 3 y. (4.1.18) 
Jy J at 

The boundary term in Eq. (4.1.18) will be canceled by the variation of /b[s]: this 
is the reason for including a boundary term in the gravitational action. That a 
boundary term is needed is due to the fact that R, the gravitational Lagrangian 
density, contains second derivatives of the metric tensor, a nontypical feature of field 
theories. 



4-1-5 Variation of the boundary term 

We now work on the variation of /b[s], as given by Eq. (4.1.9). Because the induced 
metric is fixed on df, the only quantity to be varied is K, the trace of the extrinsic 
curvature. We recall from Sec. 3.4 that 

K = n% 

= {en a n p + h al3 )n a .p 
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so that its variation is 

5K = -/i a/3 5r 7 a/3 n 7 

= -^/i a ^(%«,/3 +%/3,« -5Sa/3, M )ri' J 

= \h a Hg afi ^; 

we have used the fact that the tangential derivatives of 5g a p vanish on df. We 
have obtained 

8I B =(f eh^Sg^n^h^ 2 d 3 y, (4.1.19) 
Joy 

and we see that this indeed cancels out the second integral on the right-hand side 
of Eq. (4.1.18). Because SIo = 0, the complete variation of the gravitational action 
is 

dS G = ^T I G aP 5g^^d A x. (4.1.20) 
This produces the correct left-hand side to the Einstein field equations. 

4-1.6 Variation of the matter action 
Variation of SWI^; <?], as given by Eq. (4.1.11), yields 

SS M = [ 5{^sf=g)d A x 
Jy 



-a 

Jy ( 



05? $g aP V=9 + se d^=g ) d 4 x 



dg 

r) V 1 

- --^g a/3 )Sg^V^9d i x. 



then 



If we define the stress-energy tensor by 

T a0 = -2-^+^g a0 , (4.1.21) 

5Sm = ~\ ^T aP 6g al3 V=9~d 4 x, (4.1.22) 

and this produces the correct right-hand side to the Einstein field equations. 
We have obtained 

S(S G +S M )=0 => G a/3 = 8nT a/3 , (4.1.23) 

because the variation 5g al3 is arbitrary within f . The Einstein field equations 
therefore follow from a variational principle, and the action functional for the theory 
is given by Eq. (4.1.12). 

To see that Eq. (4.1.21) gives a reasonable definition for the stress-energy tensor, 
let us consider once more a Klein-Gordon field tp with Lagrangian density 

^ = -^(w, + ™¥)' 

It is easy to check that for this, Eq. (4.1.21) becomes 

T a p = ip, a ip,f) - ^ (i>'"ip,n + m 2 tp 2 ^g a p. 

This is the correct expression for the Klein-Gordon stress-energy tensor. You may 
look into the consistency of this result by checking that the statement of energy- 
momentum conservation, T a/3 ;/3 = 0, implies the Klein-Gordon equation. 
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4-1.7 Nondynamical term 



What is the role of 



2/ sK \h\^ 2 d 3 



y 



in the gravitational action? Because Jo depends only on the induced metric h a i, 
(through the factor |/i| 1//2 in the integrand), its variation with respect to g a p gives 
zero, and the presence of Iq cannot affect the equations of motion. Its purpose can 
only be to change the numerical value of the gravitational action. 

Let us first assume that g a p is a solution to the vacuum field equations. Then 
R = and the numerical value of the gravitational action is 



where we omit the subtraction term K for the time being. Let us evaluate this 
for flat spacetime. We choose df to consist of two hypersurfaces t = constant 
and a large three-cylinder at r = R (Fig. 4.1). It is easy to check that K = on 
the hypersurfaces of constant time. On the three-cylinder, the induced metric is 
ds 2 = -dt 2 + R 2 d£l 2 , so that ^l 2 = R 2 sin0. The unit normal is n a = d a r, so 
that e = 1 and K = n a . a = 2/R. We then have 



and this diverges when R — >• 00, that is, when the spatial boundary is pushed all 
the way to infinity. The gravitational action of flat spacetime is therefore infinite, 
even when f is bounded by two hypersurfaces of constant time. Because this 
problem does not go away when the spacetime is curved, this would imply that the 
gravitational action is not a well-defined quantity for asymptotically-flat spacetimes. 
(Of course, this is not a problem if the spacetime manifold is compact.) 

This problem is remedied by Iq. Apart from a factor of 16n, this term is chosen to 
be equal to the gravitational action of flat spacetime, as regularized by the procedure 
used above. The difference Ib — lo is then well defined in the limit R — > 00, and 
there is no longer a difficulty in defining a gravitational action for asymptotically- 
flat spacetimes. (The subtraction term is irrelevant for compact manifolds.) In 
other words, the choice 

K = extrinsic curvature of df embedded in flat spacetime (4.1.24) 

cures the divergence of the gravitational action, which is then well defined when the 
spacetime is asymptotically flat. In particular, Sq = for flat spacetime. 






r = R 




Figure 4.1: The boundary of a region f of flat spacetime. 
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4-1.8 Bianchi identities 

The Lagrangian formulation of general relativity provides us with an elegant deriva- 
tion of the contracted Bianchi identities, 

G ap . p = 0. (4.1.25) 

In this approach, Eq. (4.1.25) comes as a consequence of the invariance of So[g] 
under a change of coordinates in f . 

To prove this, it is sufficient to consider infinitesimal transformations, 

i a 4i'"=i"le°, (4.1.26) 

where e a is an infinitesimal vector field, arbitrary within f but constrained to 
vanish on df. The variation of the metric under such a transformation is 

= g'a/3 (z' ) - g a /3 (x) + g' af} (x) -g' a p (x' ) 

= (K - ^, a Wp - z",p)g^(x) - g aP {x) - g a pAx)^ 
= -^, a gnP - e %g<*n - gap,^ 

discarding all terms of the second order in e a . Using Eq. (4.1.14), we find that the 
metric variation is 

<5 5 a/3 = e a;/3 +e /3;a . (4.1.27) 
Substituting this into Eq. (4.1.20), we find 

8nSS G = [ G a/3 e a . t/3 y/=g~d 4 x 
Jy 

= ~f G ap .gt a V=g~d 4 x + <f> G af} t a dY>p. 

With e a arbitrary within f but vanishing on df, the contracted Bianchi identities 
follow from the requirement that SSg = under the variation of Eq. (4.1.27). 



4.2 Hamiltonian formulation 

4-2.1 Mechanics 

The Hamiltonian formulation of Newtonian mechanics begins with the introduction 
of the canonical momentum p, defined by 

>-%■ ^ 

It is assumed that this relation can be inverted to give q as a function of p and q. 
The Hamiltonian is then 

H(jp,q)=pq-L. (4.2.2) 

Hamilton's form of the equations of motion can be derived from a variational prin- 
ciple. Here, the action is varied with respect to p and q independently, with the 
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restriction that 6q must vanish at the endpoints. Thus, 
6S = f's(pq-H)dt 



jC(' 



* 2 / OH OH 

p8q + q8p — Sp — Sq I dt 



dp dq 

ta r t 2 ~ 



Because the variation is arbitrary between ti and £2, but = Sqfa) = 0, we 

have 

,5 = * 4 -«. ,4,,, 

These are Hamilton's equations. 

4-2.2 3 + 1 decomposition 

The Hamiltonian formulation of a field theory is more involved. Here, the Hamil- 
tonian H\p, q] is a functional of g, the field configuration, and p, the canonical 
momentum, on a spacelike hypersurface S. To express the action in terms of the 
Hamiltonian, it is necessary to foliate f into a family of spacelike hypersurfaces, 
one for each "instant of time" . This is the purpose of the 3 + 1 decomposition. 

To effect this decomposition, we introduce a scalar field t(x a ) such that t = 
constant describes a family of nonintersecting spacelike hypersurfaces T, t . This 
"time function" is completely arbitrary; the only requirements are that t be a single- 
valued function of x a , and n a oc d a t, the unit normal to the hypersurfaces, be a 
future-directed timelike vector field. 

On each of the hypersurfaces T, t we install coordinates y a . A priori, the coordi- 
nates on one hypersurface need not be related to the coordinates on another hyper- 
surface. It is, however, convenient to introduce a relationship, as follows (Fig. 4.2). 
Consider a congruence of curves 7 intersecting the hypersurfaces T, t . We do not 
assume that these curves are geodesies, nor that they intersect the hypersurfaces 
orthogonally. We use t as a parameter on the curves, and the vector t a = dx a /dt 
is tangent to the curves. It is easy to check that the relation 

t a d a t = 1 (4.2.4) 

follows from the construction. A particular curve 7p from the congruence defines 
a mapping from a point P on St to a point P' on T,f , and then to a point P" on 
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St» , and so on. To fix the coordinates of P' and P", given y a (P) on S t , we simply 
demand y a (P") = y a (P') = y a (P)- Thus, y a is held constant on each of the curves 
7- 

This construction defines a coordinate system (t, y a ) in f . There exists a trans- 
formation between this and the system x a originally in use: x a = x a (t,y a ). We 
have 

'dx a " 



and we define 

m 

to be tangent vectors on S t . These relations imply that in the coordinates (t,y a ), 
t a = S a t and e£ = S a a . We also have 

£ t e« = 0, (4.2.7) 

which holds in any coordinate system. 

We now introduce the unit normal to the hypersurfaces: 

n a = -Nd a t, n a e a a = 0, (4.2.8) 

where the scalar function N, called the lapse, ensures that n a is properly normalized. 
Because the curves 7 do not generally intersect St orthogonally, t a is not parallel to 
n a . We may decompose t a in the basis provided by the normal and tangent vectors 
(Fig. 4.3): 

t a = Nn a + N a e%; (4.2.9) 

the three-vector N a is called the shift. It is easy to check that Eq. (4.2.9) is com- 
patible with Eq. (4.2.4). 

We can use the coordinate transformation x a = x a (t, y a ) to express the metric 
in the coordinates (t,y a ). We start by writing 

dx a = t a dt + e a a dy a 

= (N dt)n a + (dy a + N a dt)e%, 

which follows at once from Eqs. (4.2.5), (4.2.6), and (4.2.9). The line element is 
then given by ds 2 = g a pdx a dx l3 , or 

ds 2 = -N 2 dt 2 + h ab (dy a + N a dt)(dy b + N b dt), (4.2.10) 




Figure 4.3: Decomposition of t a into lapse and shift. 



4.2 Hamiltonian formulation 



101 



where h a b = g a p e"e^ is the induced metric on £ t . 

We may now express the metric determinant g in terms of h = det[h a b] and 
the lapse function. We recall that g u = cofactor (<?«)/<? = h/g, as follows from 
Eq. (4.2.10). But g u = g a/3 t ia t i/3 = N- 2 g aP n a nf} = -N~ 2 , where Eq. (4.2.8) was 
used. The desired expression is therefore 

y/=g~ = NVh. (4.2.11) 

Equations (4.2.9), (4.2.10), and (4.2.11) are the fundamental results of the 3 + 1 
decomposition. 



4.2.3 Field theory 

We now return to the Hamiltonian formulation of a field theory. For simplicity, we 
will assume that the field is a scalar, but the procedure can easily be extended to 
fields of other tensorial types. We begin by defining the "time derivative" of q to 
be its Lie derivative along the flow vector t a , 

q = £ t q- (4.2.12) 

In the coordinates (t,y a ), £tq = dq/dt, and q reduces to the ordinary time deriva- 
tive. We also introduce the spatial derivatives, q, a = q, a e"- The field's Lagrangian 
density can then be expressed as J£(q,q,q, a )- 

The field's canonical momentum p is defined by 

It is assumed that this relation can be inverted to give q in terms of q, q, a , and p. 
The Hamiltonian density is then 

^(p,q,q,a)=pq-^^. (4.2.14) 

Because of the factors of \Z— S m Eqs. (4.2.13) and (4.2.14), the Hamiltonian density 
is not a scalar with respect to transformations y a — > y a . We might introduce a 
scalarized version J^ ca i a r, defined by J(f = VhJ$f sca \ ai = y/—g J%lcai&i/N, but such 
an object would turn out not to be as useful as the original, nonscalar, Hamiltonian 
density. The Hamiltonian functional is defined by 

H\p,q]= [ Je(p,q,q, a )d 3 y. (4.2.15) 

The Hamiltonian functional is an ordinary (nonscalar) function of time t. 

We consider a region f of spacetime foliated by spacelike hypersurfaces E( 
bounded by closed two-surfaces St (Fig. 4.4); f itself is bounded by the hypersur- 
faces Sjj, S( a , and 38, the union of all two-surfaces St- To obtain the Hamilton 
form of the field equations, we will vary the action with respect to q and p, treating 
the variations Sq and Sp as independent. We will demand that Sq vanish on the 
boundaries T, tl , £( 2 , and 38. 

The action functional is given by 



S= dt f (pq-Jff)d 3 y, 



and variation yields 



SS = j dt j [p8q + q5p- -^-5p- -^-Sq- -^-5q ia )d d y. 



102 



Lagrangian and Hamiltonian formulations of general relativity 




Figure 4.4: The region V, its boundary df, and their foliations. 



The first term may be integrated by parts: 

rta r 



f dt ( p5qd 3 y = f dt^j- f pdqd 3 y - f dt f p5qd 3 y 

= / pSqd 3 y— / p8qd 3 y— / dt pSqd 3 y 

J^t<2 J^ti Jtl JUt 

= ~ [ dt f p6qd 3 y, 
because Sq = on S tl and T, t2 . We treat the last term similarly: 

-f'dtf ^6q, a d 3 y = -tdtf ^§^ S q, a Vhd 3 y 

Jti Je, Oq,a Jti Je, °Q,a 



- [ dtl 

Jtl JS 

it! l-Et ^ 

r*j 

Jti JY. 



scalar 



5q dS a 



S t d( l.a 

( ^ ^scalar \ 

8qd 3 y. 



6qVhd 3 



In the second line we have used the three-dimensional version of Gauss' theorem, 
with dS a denoting the surface element on St- In the third line we have used the 
divergence formula A a , = h~ 1 / 2 (h 1 / 2 A a ) ta and the fact that Sq vanishes on St- 
Gathering the results, we have 



SS 



it! l t 



P + 



dq 



Ha ) a 



5q + 



and 



5S = 



p=- 



1 = 



dp 
dp 



dp } d 3 y, 



(4.2.16) 



dq \dq, a J ta ' 

These are Hamilton's equations for a scalar field q and its canonical momentum p. 

As a concrete example, we consider once again a Klein-Gordon field tp with its 
Lagrangian density 

For simplicity, we choose our foliation to be such that N a = 0. This implies g tt = 
l/9tt, 9 ta = 0, and g ab = h ab . Then jgf = -\{g tt ^ 2 + h ab ip ta ip fi + m 2 ip 2 ), p = 
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-\f zr gg ip, and Eq. (4.2.14) gives 



3V = - *L a + l -V=g~ {h ab ^, a ^, b + mV) ■ 
^y—gg 2 

The equations of motion are 

It is easy to check that from these follow the Klein-Gordon equation, g al3 ip- a p — 
m 2 ip = 0, in the selected foliation. 

4-2-4 Foliation of the boundary 

Before tackling the case of the gravitational field, we need to provide additional 
details regarding the foliation of 88, the timelike boundary of f , by the two-surfaces 
St- (Refer back to Fig. 4.4.) 

The closed two-surface St is the boundary of the spacelike hypersurface T, t , on 
which we have coordinates y a , tangent vectors e", and an induced metric /i &. It is 
described by an equation of the form $(y°) = 0, or by parametric relations y a (6 A ), 
where 8 A are coordinates on St- We use r to denote the unit normal to St, and we 
define an associated four-vector r a by 

r a = r a e a a . (4.2.17) 

This satisfies the relations r a r a = 1 and r a n a = 0, where n a is the normal to T, t . 
The three-vectors e a A = dy a /d6 A are tangent to St, so that r a e a A = 0. This implies 
r « e S = 0; where 

Br a 

e a A ^e" a e A = ^ J . (4.2.18) 

In this equation, it is understood that x a stands for the functions x a (y a (6 A )), where 
x a (y a ) are the parametric relations defining Sj. 
The induced metric on St is given by 

ds 2 =<JABde A dd B , (4.2.19) 

where o A b = K b e a A e b B = (g a p e£ef )e a A e b B , or, using Eq. (4.2.18), 

(tab = g a pe a A e B . (4.2.20) 

Its inverse is denoted a AB . The three-dimensional completeness relations, h ab = 
r a r b + a AB e A e b B , are easily established (see Sec. 3.1). It follows that the four- 
dimensional relations, g al3 = —n a nP + h ab e"e^, can be expressed as 

g aP = _ n a n f3 + r a r /3 + AB (4.2.21) 



This can be established directly by computing all inner products between the vectors 
n a , r a , and e A . 

The extrinsic curvature of St embedded in S t is defined by Uab = r a \t,e A e b B (see 
Sec. 3.4), or 

k A B=r a #e a A e B . (4.2.22) 

We use k to denote its trace: k = o AB kAB- 

A priori, the coordinates 6 A on a given two-surface (Sf , say) are not related to 
the coordinates on another two-surface (Sf, say). To introduce a relationship, we 
consider a congruence of curves j3 running on &$, intersecting the two-surfaces St 
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orthogonally, and therefore having n a as their tangent vectors. We demand that if 
the curve /3p intersects Sp at the point P' labelled by 6 A , then the same coordinates 
will designate the point P" at which fl P intersects St» ■ Because 6 A does not vary 
along these curves, and because t can be chosen as a parameter, we have 



where the factor iV _1 comes from Eq. (4.2.8) and the normalization condition 
n a n a = —1. The construction ensures that n a and are everywhere orthogo- 
nal. 

The hypersurface 38 is foliated by the two-surfaces St- We put coordinates z l 
on 38, and introduce the tangent vectors ef = dx a jdz 1 . The induced metric on 38 
is then given by 

7«=flo i9 efef. (4-2.24) 
Its inverse is 7^, and the completeness relations take the form 

fl a ' 3 = rV+7 < 'efeJ. (4.2.25) 

While the coordinates z l are a priori arbitrary, the choice z l = (t,6 A ) is clearly 
convenient. In these coordinates, 

= Nn a dt + e a A dd A , 

where Eqs. (4.2.18) and (4.2.23) were used. For displacements within 38, the line 
element is 

ds% = g a pdx a dx® 

= 9ap {Nn a dt + e a A d6 A ) (Nn dt + e% d6 B ) 
= {9apn a n p )N 2 dt 2 + {g a pe a A e p B ) d6 A d9 B , 
where the relation n a e A = was used. We have obtained 

jij dz i dz j = -N 2 dt 2 + oab de A d6 B . (4.2.26) 

This implies ^—7 = Ny/a, where 7 and a are the determinants of 7^ and gab , 
respectively. 

Finally, we let J^j be the extrinsic curvature of 38 embedded in spacetime. This 
is given by 

<%i j =r a ;pe?e f3 j , (4.2.27) 

because r a , the unit normal to the two-surfaces St, is also normal to 38. We will 
use to denote its trace: Jff = -y^^ij- 

Table 4.1 provides a list of the various geometric quantities introduced in this 
subsection. 



Table 4.1: Geometric quantities of T, t , St, and 



Surface T, t 



Unit normal n a r a r a 

Coordinates y a 8 A z l 

Tangent vectors e" e\ ef 

Induced metric gab "fij 

Extrinsic curvature Uab ^ij 
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4-2.5 Gravitational action 

As a first step toward constructing the gravitational Hamiltonian, we must subject 
the gravitational action Sq to the 3 + 1 decomposition described in Sec. 4.2.2. Our 
starting point is Eq. (4.1.12), 

16tt5 g = / RV=g~d i x + 2<( eK^ 1 / 2 d 3 y, 
Jy Joy 

where the subtraction term I is omitted for the time being; it will be re-instated 
at the end of the calculation. Here, d"V is the closed hypersurface bounding the 
four-dimensional region Y , y a are coordinates on df, h a b is the induced metric, 
K is the trace of the extrinsic curvature, and e = n a n a , where n a is the outward 
normal to d"V . 

Throughout this section, the quantities n a , y a , /i &, and K ab have referred specif- 
ically to the spacelike hypersurfaces T, t , and we need to be more careful with our 
notation. We have seen that f's boundary is the union of two spacelike hypersur- 
faces E tl and S i2 with a timelike hypersurface 3$ (Fig. 4.4): 

df = S t2 u(-£ tl )U#. 

The minus sign in front of T, tl serves to remind us that while the normal to df 
must be directed outward, the normal to S 4l is future-directed and therefore points 
inward. With the notation introduced in the preceding subsection, the gravitational 
action takes the form 

16ttS g = [ RV^g^x - 2 f K\fhd 3 y + 2 f K\fhd 3 y + 2 f X-J^d 3 z, 

and the integration over S tl incorporates the extra minus sign just discussed. 

The region f is foliated by spacelike hypersurfaces T, t on which the Ricci scalar 
is given by (Sec. 3.5.3) 

R = 3 R + K ab K ab - K 2 - 2{n a . p n p - n a n%). a , 

where 3 R is the Ricci scalar constructed from /i &. Using Eq. (4.2.11), which we 
write as ^J^^gd^x = N^/hdtd 3 y, we have that 

/ R^d 4 x = [' dt [ ( 3 R + K ab K ab - K 2 )NVhd 3 y 

-2 I (n a . n fi - n a n fi .g) dT, a . 
Jar 

The new boundary term can be expressed as integrals over S tl , S ta , and 8§. On 
Stj , dT, a = n a \fhd 3 y — this also incorporates an extra minus sign — and 

-2 f (n^n 13 - n a n^) dT, a = -2 f n p J3 Vhd 3 y = -2 f KVhd 3 y. 

We see that this term cancels out the other integral over S tl coming from the 
original boundary term in the gravitational action. The integrals over Sj 2 cancel 
out also. There remains a contribution from on which dT, a = r a ^—'yd 3 z, giving 

-2 f (n a /-TiV.a)rfS a = -2 f n a . p n l3 r a V^d 3 z = 2 [ r a ., /3 n a n /3 y/^d 3 z, 

where we have used n a r a = 0. 
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Gathering the results, we have 



\&-kS g = [' dt [ ( 3 R + K ab K ab - K 2 )NVhd 
Jti is, 

+ 2 f {X + r a . p n a n l3 )^d z z. 
J 88 



? 'v 



We now use the fact that 38 is foliated by the closed two-surfaces St- We substitute 
^f z ^d z z = N^/adtd 2 6 and express Jf" as 

= r a ., p {g ap -r a r% 

so that the integrand becomes 

X +r a ., p n a n p = r a - 3 {g af} - r a r p + n a n p ) 
= r a ., p (a AB e<\e%) 

= & AB kAB 
= k. 

We have used Eqs. (4.2.21) and (4.2.25) in these manipulations. Substituting this 
into our previous expression for the gravitational action, we arrive at 

Sg = lib C dt {L ( 3i? + KabKab ~ R2 ) N ^ d3 y 

+ lj (jfc- k )N^d 2 9^j. (4.2.28) 

We have re-instated the subtraction term, by inserting fc into the integral over St- 
This is justified by the fact that for flat spacetime, the integral over S t vanishes, 
so that the sole contribution to Sq comes from the boundary integral; the ko term 
prevents this integral from diverging in the limit St — > oo, and it ensures that Sq 
vanishes identically for flat spacetime. Thus, 

ko = extrinsic curvature of St embedded in flat space. 

The ko term makes the gravitational action well defined for any asymptotically-flat 
spacetime. For compact spacetime manifolds, this term is irrelevant. 

The matter action should also be subjected to the 3+1 decomposition. Because 
the procedure is straightforward, and because we would do well to keep things as 
simple as possible, we shall omit this step here. In the rest of this section we will 
consider pure gravity only, and put the matter action to zero. 

4-2.6 Gravitational Hamiltonian 
To construct the Hamiltonian, we must express Sg in terms of 

h ab = £ t h ab , (4.2.29) 
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where t a is the timelike vector field defined by Eq. (4.2.9). We calculate this as 
follows. First we recall the definition of the induced metric and write 

Kb = £ t {g a /3e"e^) = (£iSa/3)<ef, 

where we have used Eq. (4.2.7). Equation (4.2.9) implies that the Lie derivative of 
the metric is given by 

£t9a/3 = t a; p + tj3- a 

= (Nn a + N a ), p + (Nnp + N ), a 

= n a N tP + N :Ct np + N(n a -p + np- a ) + N a .p + Np. a , 

where N a = N a e". Finally, projecting this along e"e^ gives 

h ab = 2NK ab + N a]b + N bla , 

where we have used the definitions of extrinsic curvature and intrinsic covariant 
differentiation found in Sec. 3.4. 
We have obtained 

K ab = ^ (h ab - N alb - N bla ) . (4.2.30) 

The gravitational action therefore depends on h ab through the extrinsic curvature. 
Notice that the action does not involve N nor N a , so that momenta conjugate to N 
and N a are not defined. This means that unlike h ab , the lapse and the shift are not 
dynamical variables. This was to be expected: N and N a only serve to specify the 
foliation of f into the spacelike hypersurfaces T, t ; because this foliation is arbitrary, 
we are completely free in our choice of lapse and shift. 
The momentum conjugate to h ab is defined by 




(4.2.31) 



where _£?g is the "volume part" of the gravitational Lagrangian. (The "bound- 
ary part" is independent of h ab .) Because J£q is expressed in terms of K ab , it is 
convenient to write Eq. (4.2.31) in the form 

dh ab oK mn \ J 

where 

Ifoy/^g&G = [ 3 R + (h ac h bd - h ab h cd )K ab K cd ]NVh 
follows from Eq. (4.2.28). Evaluating the partial derivatives gives 

16tt p ab = Vh {K ab - Kh ab ), (4.2.32) 

and we see that the canonical momentum is closely related to the extrinsic curvature. 
The "volume part" of the Hamiltonian density is 

JffG=P ab K b -V^9^G. (4.2.33) 

Using our previous results, we have 

167rJ^ G = Vh(K ab -Kh ab )(2NK ab +N alb + N bla ) 

- ( 3 R + K ab K ab - K 2 )NVh 

= (K ab K ab -K 2 - 3 R)NVh + 2(K ab - Kh ab )N a]b Vh 

= (K ab K ab -K 2 - 3 R)NVh + 2 [(K ab - Kh ab )N a ] {b Vh 

-2(K ab - Kh ab ) ]b N a Vh. 
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The full Hamiltonian is obtained by integrating J^q over S t and adding the bound- 
ary terms: 

16nH G = [ 16tt Mb d 3 y - 2 <£ (k - k )N^d 2 6 

J^t JSt 

= J [N(K ab K ab -K 2 - 3 R)-2N a (K ab -Kh ab ) lb \Vhd 3 y 
+ 2 I {K ab - Kh ab )N a dS b -2(f (k- k )N^d 2 6. 

JS t JSt 

Writing dS b = r b \fa d 2 6, the gravitational Hamiltonian becomes 

16nH a = f \N{K ab K ab — K 2 — 3 R) — 2N a {K ab — Kh ab )\ b Vhd 3 y 

-2 I \N(k - ko) - N a {K ab - Kh ab )r b ] V^d 2 6. (4.2.34) 
Js t L J 

It is understood that here, K ab stands for the function of h ab and p ab defined by 
Eq. (4.2.32); this is given explicitly by 

VhK ab = 16tt (p ab - i ph ab ^j , (4.2.35) 



where p = h ab p' 



,o6 



4-2.7 Variation of the Hamiltonian 



The equations of motion for the gravitational field are obtained by varying the 
action of Eq. (4.2.28) with respect to N, N a , h ab , and p ab , which are all treated as 
independent variables. The variation is restricted by the conditions 

SN = 6N a = 6h ab = on S u (4.2.36) 

but there is no requirement that 6p ab vanish on the boundary. As a preliminary 
step toward calculating SSq, we shall now carry out the variation of Hq. The 
computations presented here are rather formidable; the punch line is delivered in 
Eq. (4.2.46). 

We begin with a variation with respect to both N and N a . Taking Eq. (4.2.36) 
into account, Eq. (4.2.34) gives 

16tt5 n H g = [ (-C6N -2C a 6N a )Vhd 3 y, (4.2.37) 

where 

C = 3 R + K 2 -K ab K ab , C a = {K b -K5 b )\ b . (4.2.38) 

This was easy; the remaining variations will require considerably more labour. 

To carry out a variation with respect to h ab or p ab , we must express Hq in 
terms of these variables, instead of h ab and K ab as was done in Eq. (4.2.34). Using 
Eq. (4.2.35), a few steps of algebra give 

16ttH g = Hx+H s , (4.2.39) 

where 



= I [Nh- 1 / 2 (p ab p ab - \p 2 ) - Nh 1 / 2 3 R - 2N a h 1/2 (h~ 1/2 p ab ) , J d 3 y (4.2. 



40) 
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is the "bulk" term, while 

H s = -2<£ [jV(* - ko) - ^ a /i- 1/2 p o6 r 6 ] V^d 2 6 (4.2.41) 

is the "boundary" term. We have introduced the notation Hsigma = 16nH-z, 
p ab = 167rp° 6 , and so on; this usage was anticipated in Eqs. (4.2.38). 
We first vary Hq with respect to p ab . From Eq. (4.2.40) we have 

5 P H^= I Nh- 1 /%{p ab p ab -±p 2 )d 3 y-26 p f N a {h~^ 2 p ab ) {b h 1/2 d 3 y. 

We substitute 

S P {p ab p ab - \p 2 ) = 2{p ab - \ph ab )5p ab 
inside the first integral, and we integrate the second by parts. This gives 

6 p Hv = J^2[Nh-^ 2 {p ab -^ph ab )+N {alb) ]dp ab d 3 y 

- 2 I N a h- 1 / 2 5p ab r b V^d 2 6. 
J s t 

The boundary term is precisely equal to (minus) the variation of Hs- We therefore 
have obtained 

S P H G = [ H ab 6p ab d 3 y, (4.2.42) 

U ab = 2Nh~ 1 / 2 (p ab - ^ph ab ^j + 2N {alb) . (4.2.43) 

To vary Hq with respect to h ab is more labourious, and we will rely on compu- 
tations already presented in Sec. 4.1. We begin with the bulk term: 

5 h Hv = £ [-Nh- 1 {p ab p ab - \f)5 h h}l 2 + Nh-^Sn {p ab p ab - \p 2 ) 
- N5 h {h}-I 2 3 i?)] d 3 y - 25 h j N a h~ x l 2 p ab r b \[o d 2 Q 



where 



+ 2S h f N a]b p ab d 3 y, 



in which the last term on the right-hand side of Eq. (4.2.40) was integrated by parts. 
The variation of the integral over St vanishes because h ab is fixed on the boundary. 
In the first term within the integral over S t we substitute 

6 h h 1 / 2 = 1 -h 1 / 2 h ab 6h ab , 

while in the second term, 

Sh{P ab Pab ~ \P 2 ) = 2(p a J cb - \pp ab )5h ab . 

In the third term, we use the three-dimensional version of Eq. (4.1.16), 

Snh 1 ' 2 3 R = -h^ 2 G ab Sh ab + h 1 ' 2 dv c {c , 

where G ab = R ab — 1 3 R h ab is the three-dimensional Einstein tensor, and dv° = 
h ab 6T c ab - h ac 6T b ab . Finally, in the last term we substitute 

6 h N a{b = N c {b 6h ac + h ac N d 6T c bd . 
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After a few steps of algebra, we obtain 

ShHe = f \-^Nh-^ 2 (p cd p cd - \f)h ab + 2Nh- 1 l 2 {p a c p bc - \pp ab ) 

+ Nh}' 2 G ab + 2p< a N b) ^ 6h ab d 3 y 

+ £ [-Nh 1 ' 2 dv c ]c + 2p b c N d 5T c bd ] d 3 y. 

We now leave the first integral alone, and set to work on the second integral, 
beginning with the first term. After integrating by parts, 

- f Nh 1 / 2 dv c ]c d 3 y = f N^h 1 ' 2 d 3 y - I NSv c r c ^/d d 2 6 

= [ NJv c h 1 l 2 d 3 y + <f Nh ab 5h ab , c r c ^d 2 8, 
Jv, Js t 

where the three-dimensional version of Eq. (4.1.17) was used. To express the first 
integral in terms of 6h ab , we use the relation 

<5r c a6 = \ h cd [(Sh da ) lb + (5h db ) la - (5h ab ) ld ] , 

which is easy to establish. (Note that the covariant derivative is defined with respect 
to the reference metric h ab , about which the variation is taken.) We have 

dv c = i (h ab h cd - h ac h bd ) [(6h da ) ]b + (6h db ) ]a - (Sh ab ) ]d ] 

and then 

N c dv c = ^{h ab N> d -N> a h bd )[(6h da ) lb + (dh db ) ]a -(dh ab ) ld ] 

= -(h ab N' d -N' a h bd )(dh ab ) ld ; 

the second line follows by virtue of the antisymmetry in a and d of the first factor. 
After another integration by parts we obtain 

-/ Nh}' 2 dv%d 3 y = [ {h ab N^ d d -N\ ab )5h ab h}/ 2 d 3 y 

+ I Nh ab 5h ab , c r c V^d 2 8, 
J s t 

where we have used the fact that 6h ab vanishes on St- All this takes care of the first 
term inside the second integral for ShHz- We now turn to the second term inside 
the same integral. We have 

/ 2p b c N d 5Y\ d d 3 y = f p ab N d [(6h ab ) ]d + (6h ad ) lb -(6h bd ) la ]d 3 y 

= f h- 1 l 2 p ab N d (5h ab \ d h 1 l 2 d 3 y 

= - f (h-^p^N^Sh^d'y, 

where we have integrated by parts and put 6h ab = on St- 

Gathering the results, we find that the variation of the bulk term is 

S h Hv= [ V ab 6h ab d 3 y+ I Nh ab dh abtC r c V^d 2 e, 
J s t J s t 
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where P ab will be written in full below. On the other hand, variation of the bound- 
ary term gives 

5 h H s = -2<f N5kV^d 2 6, 
J s t 

and 6k = \h at '8h a ^ c r c is the three-dimensional analogue of a result previously 
derived in Sec. 4.1.5. Thus, 

S h H s = -<f Nh ab dh abtC r c V^d 2 e, 
J s t 

and this cancels out the boundary integral in ShHy.- The variation of the full 
Hamiltonian is therefore 



S h H G 



= f V ab 6h al 



d 3 y, 



(4.2.44) 



where 



V ab = Nh 1/2 G ab - \Nh- 1/2 (p cd p cd - \p 2 )h ab + 2Nh- 1/2 (p a c p bc - \pp ab ) 

- h 1 / 2 (7Vl° 6 - h ab N\\) - h 1 / 2 [h' 1 / 2 p ab N c ) + 2p< a N b) w (4.2.45) 

Here, as before, G ab = R ab — 1 3 Rh ab is the three-dimensional Einstein tensor. 

Combining Eqs. (4.2.37), (4.2.42), and (4.2.44), we find that the complete varia- 
tion of the gravitational Hamiltonian, under the conditions of Eq. (4.2.36), is given 

by 

SH G = J (v ab Sh ab +n ab dp ab -C6N -2C a 6N a ^)d 3 y, (4.2.46) 

where V ab = V ab /(16n) is given by Eq. (4.2.45), % ab by Eq. (4.2.43), while C = 
C/(16tt) and C° = C°/(16tt) are given by Eq. (4.2.38). 

4-2.8 Hamilton's equations 
The equations of motion are obtained by varying the gravitational action, expressed 

ta 



as 



S a = f'dt [ p ab h ab d 3 y-H G 
Jti Uzt 



with respect to the independent variables N, N a , h ab , and p ab . Variation yields 

6S G = [ t2 dt\[ (p ab 6h ab + h ab 6p ab )d 3 y-6H G ], 

where 6H G is given by Eq. (4.2.46). After integrating the first term by parts, we 
obtain 

6S G = f'dt f [(hob - n ab ) 6p ab - (p ab + V ab ) 6h ab +C6N + 2C a 6N a ~\ d 3 y. 

(4.2.47) 

Demanding that the action be stationary implies 



h a b — Hab, P 



•ab 



-V ab , 



o, 



C a = 0. 



(4.2.48) 



These are the vacuum Einstein field equations in Hamilton form. The first two 
govern the evolution of the conjugate variables h a b and p ab ; it is easy to check that 
h a b = H. a b just reproduces the relation between h ab and p ab implied by Eqs. (4.2.30) 
and (4.2.35). The last two are the constraints equations first derived in Sec. 3.6; 
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the relations C = and C a = are usually referred to as the Hamiltonian and 
momentum constraints of general relativity, respectively. 

The Hamiltonian formulation of general relativity suggests the following strategy 
for solving the Einstein field equations. First, select a foliation of spacetime into 
spacelike hypersurfaces by specifying the lapse N and the shift N a ; the choice of 
foliation is completely arbitrary. Defining h ab to be the induced metric on the 
spacelike hypersurfaces, the full spacetime metric is given by Eq. (4.2.10): 

ds 2 = -N 2 dt 2 + h ab (dy a + N a dt)(dy b + N b dt). (4.2.49) 

Next, choose initial values for the tensor fields h ab and K ab , where K ab is the 
extrinsic curvature of the spacelike hypersurfaces. This choice is not entirely arbi- 
trary, because the constraint equations must be satisfied: the initial values must be 
solutions to 

3 R + K 2 - K ab K ab = 0, (K ab - Kh ab ) lb = 0, (4.2.50) 

where 3 R is the Ricci scalar associated with h ab , and K = h ab K ab . Finally, evolve 
these initial values using Hamilton's equations, h ab = T-L ab and p ab = —V ab , which 
may be written in the form (Sec. 4.5, Problem 4) 

h ab = 2NK ab + £ N h ab (4.2.51) 

and 

Kab = N lab - N(R ab + KK ab - 2K c a K bc ) + £ N K ab . (4.2.52) 

In these equations, the Lie derivatives are directed along N a , the shift vector. This 
formulation of the field equations, usually referred to as their 3 + 1 decomposition, 
is the usual starting point of numerical relativity. 

4-2.9 Value of the Hamiltonian for solutions 

We now return to Eq. (4.2.34) and ask: What is the value of the gravitational 
Hamiltonian when the fields h ab and K ab satisfy the vacuum field equations (4.2.50)- 
(4.2.52)? The answer is that by virtue of the constraint equations, only the bound- 
ary term contributes to the solution- valued Hamiltonian: 

^solution = _ J_ I I N ( k _ fco ) _ Na ( K *b _ Kh ab )r b ]Va r d 2 6. (4.2.53) 
Sir J St i J 

As was discussed previously, this boundary term is relevant only when the spacetime 
manifold is noncompact. For compact manifolds, jjg> lutl ° n = 0. The physical 
significance of jjg> lutl0n for asymptotically-flat spacetimes will be examined in the 
next section. 

4.3 Mass and angular momentum 

4-3.1 Hamiltonian definitions 

It is natural to expect that the gravitational mass of an asymptotically-flat space- 
time — its total energy — should be related to the value of the gravitational Hamil- 
tonian for this spacetime. We will explore this relation in this section, and motivate 
another between the Hamiltonian and the spacetime's total angular momentum. 

The solution- valued Hamiltonian, jjg' 1 "* 1011 given by Eq. (4.2.53), depends on 
the asymptotic behaviour of the spacelike hypersurface £t, and on the asymptotic 
behaviour of the lapse and shift. While the lapse and shift are always arbitrary, the 
fact that the spacetime is asymptotically flat gives us a preferred behaviour for the 
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hypersurfaces. We shall demand that T, t asymptotically coincide with a surface of 
constant time in Minkowski spacetime: If (i, x, y, z) is a Lorentzian frame at infin- 
ity, then the asymptotic portion of S 4 must coincide with a surface t = constant. 
In this portion of T, t , the (arbitrary) coordinates y a are related to the spatial 
Minkowski coordinates, and we have the asymptotic relations y a — > y a (x,y,z); 
similarly, x a — > x a (i, x,y, z). We note that t is proper time for an observer at rest 
in the asymptotic region, and infer that this observer moves with a four-velocity 
u a = dx a /di. Because this vector is normalized and orthogonal to the surfaces 
i = constant, it must coincide with the normal vector n a , and we have another 
asymptotic relation, n a — > dx a /di. Substituting this into Eq. (4.2.9) gives us 

an asymptotic relation for the flow vector. This shows that once the asymptotic 
behaviour of £t has been specified, there is a one-to-one correspondence between 
a choice of lapse and shift and a choice of flow vector. The solution- valued Hamil- 
tonian can then be regarded either as a function of N and N a , or as a function of 
t a . 

We shall define M, the gravitational mass of an asymptotically-flat spacetime, 
to be the limit of jjg> lutl0n when St is a two-sphere at spatial infinity, evaluated 
with the following choice of lapse and shift: N = 1 and N a = 0. From Eq. (4.2.53), 

M = -^- lim I (k-k )V^d 2 6. (4.3.1) 

07T S,->oo Jg t 

Here, oab is the metric on St, k = o AB kAB is the extrinsic curvature of St embedded 
in X t , and ko is the extrinsic curvature of St embedded in flat space. The quantity 
defined by Eq. (4.3.1) is called the ADM mass of the asymptotically-flat spacetime; 
the name refers to the seminal work by Arnowitt, Deser, and Misner. 

The choice N = 1, N a = implies that asymptotically, t a — > dx a /di, so 
that the flow vector generates an asymptotic time translation. The ADM mass 
is then just the gravitational Hamiltonian for this choice of flow vector, and we 
have made a formal connection between total energy and time translations. This 
connection is both deep and compelling, and it can be adapted to give a definition 
of total angular momentum. Indeed, the gravitational Hamiltonian should provide 
a similar connection between angular momentum and asymptotic rotations, which 
are characterized by t a — > <f> a = dx a /dcj), where 4> is a rotation angle defined in the 
asymptotic region in terms of the Cartesian frame (x, y, z). This corresponds to the 
choice N = 0, N a = 4> a = dy a /d4> of lapse and shift. 

We shall define J, the angular momentum of an asymptotically-flat spacetime, 
to be (minus) the limit of jjg' 1 "* 1011 when St is a two-sphere at spatial infinity, 
evaluated with N = and N a = 4> a . From Eq. (4.2.53), 

j = __L i im I (K ab -Kh ab )4> a r b V^d 2 8. (4.3.2) 

»7T S,-s>oo < / S( 

A minus sign was inserted to recover the usual right-hand rule for the angular 
momentum. Notice that this definition of angular momentum refers to a specific 
choice of rotation axis, and <j> is the angle around this axis. 

4-3.2 Mass and angular momentum for stationary, axially symmetric 

spacetimes 

To show that these definitions are in fact reasonable, we shall calculate M and J 
for an asymptotically-flat spacetime that is both stationary and axially symmetric. 
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In the asymptotic region r ra, the metric of such a spacetime can be expressed 
as 

ds* = _ ^ _ df 2 + ^ + ^ ( dr 2 + r 2 dfi 2) _ 4jsn^ ^ (4 3 3) 

where m and j are the spacetime's mass and angular-momentum parameters, re- 
spectively. We will show that M = m and J = j, and confirm that the Hamiltonian 
definitions are well founded. We note that the validity of this metric in the asymp- 
totic region could always be used to define mass and angular momentum. Our 
Hamiltonian definitions are more powerful, however, because they do not involve a 
particular coordinate system, and they stay meaningful even when the spacetime is 
not stationary nor axially symmetric. 

We choose the hypersurfaces £t to be surfaces of constant t, and n a = — (1 — 
m/r)d a t is the unit normal. (Throughout this calculation we work consistently to 
first order in ra/r.) The induced metric on T, t is given by 



h ab dy a dy b = (l + ^ {dr 2 + r 2 dtf). 



The boundary St is the two-sphere r = R, and the limit R — > oo will be taken at 
the end of the calculation. Its unit normal is r a = (1 + m/r)d a r, and 



r 2m\ 

( 1 + ir> 



a AB d8 A d6 ±s = ( 1 + ^ \ R 2 dti 2 



gives the two-metric on St- 

To evaluate M we must first calculate k. This is given by k = r°| and a brief 
calculation yields k = 2(1 — 2m/R)/R. To this we must subtract ko, the extrinsic 
curvature of a two-surface of identical intrinsic geometry, but embedded in flat 
space. On this surface, 

a%de A de B = r' 2 dn 2 , 

where R' = R(l+m/R), so that a% = u A b- We have k = 2/R' = 2(l-m/R)/R, 
and simple algebra yields k — ko = —2m/R 2 . On the other hand, \fa d 2 6 = R 2 (l + 
2m/R) sin6d8d(j), and substitution into Eq. (4.3.1) yields 

M = m, (4.3.4) 

the desired result. 

To evaluate J we must first calculate K a i,<j) a r b = K<j, r (l — m/r), where = 
n a -,p e-a e b- (The second term in the integrand, Kh a i ) <j) a r b , vanishes identically.) The 
relevant component of the extrinsic curvature is K<p r = (1 — m/r)r^ r . Using 

2m \ _w 2 i 



»" = -! + —, 9** = -^, 
\ r ) r 6 

we find that r*^ r = — 3j sin 2 6/r 2 , and this gives K^ r = —3j sin 2 6/R 2 . Substituting 
this into Eq. (4.3.2) yields J = (3j/4) / sin 3 6 d6, or 

J = j, (4-3.5) 



the desired result. 
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4-3.3 Komar formulae 

An appealing feature of the Hamiltonian definitions for mass and angular momen- 
tum is that they do not involve a specific choice of coordinates. Alternative defini- 
tions that share this property can be produced for stationary and axially symmetric 
spacetimes. These are known as the Komar formulae, and they are 

M=~ lim I V a ^ (t) dS aP (4.3.6) 



and 

Here, £^ is the spacetime's timelike Killing vector, and £fy is the rotational Killing 
vector; they both satisfy Killing's equation, £ a -p +£p- a = 0. The surface element is 
given by (Sec. 3.2) 

dS a/ 3 = -2n [a r p] Vad 2 e, (4.3.8) 

where n a and r a are the timelike and spacelike normals to St, respectively. 

To establish that these formulae do indeed give M = m and J = j, we must 
prove that for the spacetime of Eq. (4.3.3), the relations 

-2V a ^ t) n a r p = -2m/r 2 = *-*„, V a ^n a r^ = K ah $ a r b 

hold in the limit r — > oo. We begin with the first relation: 

-2V a e (t) n a r p = 2^n a rP 

= 2T<* Pl n a r?q t) 
= -2(1 - 2m/r)T t rt 
= -2m/r 2 , 

as required. We have used Killing's equation in the first line, and inserted n a = 
— (1 — m/r)d a t, r a = r a e" = (1 — m/r)dx a /Or, and = m(l + 2m/r)/r 2 in the 
following steps. To prove the second relation requires less work: 

V a ^n«^ = ~^ m n a r p 

= n a , p {r e«)(r 6 ef) 
= (n a .,pe a y b )<f> a r b 
= K ab cf> a r b . 

Once again, Killing's equation was used in the first line. The second line follows 
from the fact that the Killing vector is orthogonal to n a . In the third line, the 
vectors £^ and r 13 were decomposed into the basis e". Finally, the last line follows 
from the definition of the extrinsic curvature. These computations prove that the 
definitions of Eq. (4.3.6) and (4.3.7) do indeed imply M = m and J = j. We 
see that for stationary and axially symmetric spacetimes, the Komar formulae are 
equivalent to our Hamiltonian definitions for mass and angular momentum. 

The Komar formulae can be turned into hypersurface integrals by invoking 
Stokes' theorem (Sec. 3.3), 



j> B aP dS aP = 2 j B ap . p dT, a , 
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where B" 13 is any antisymmetric tensor field, and S is the two-dimensional boundary 
of the hypersurface X. This is possible because when is a Killing vector, the 
tensor B al3 = V a £^ is necessarily antisymmetric. We have 

B ap , p = (v a ^) ;/3 = -(v^ a )* = 

where □ = V a V a . Using the fact that all Killing vectors satisfy = —R a p£? 
(Sec. 1.13, Problem 9), we have established the identity 

Because the hypersurface £ is spacelike, we have that dT, a = —n a Vhd 3 y. Using 
the Einstein field equations, we then obtain 

j^ a edS aP = IJr al) -^Tg a ^n a f^d 3 y. 

Finally, combining this with Eqs. (4.3.6) and (4.3.7), we arrive at 

M = 2 £ (r aP - l - Tg a ^jn a f [t) Vhd 3 y (4.3.9) 

and 

J=- f^~\ n a f w Vhd 3 y. (4.3.10) 

In these equations, £ stands for any spacelike hypersurface that extends to spatial 
infinity. If £ had two boundaries instead of just one, then an additional contribution 
from the inner boundary would appear on the right-hand side of Eqs. (4.3.9) and 
(4.3.10). Such a situation arises when the stationary, axially symmetric spacetime 
contains a black hole (see Sec. 5.5.3). 

It is a remarkable fact that M and J are defined fundamentally in terms of 
integrals over a closed two-surface at infinity. These quantities should therefore be 
thought of as properties of the asymptotic structure of spacetime. It is only in the 
case of stationary, axially symmetric spacetimes that M and J can be defined as 
hypersurface integrals. 

4.3.4 Bondi-Sachs mass 

The ADM mass was constructed in Sec. 4.3.1 by selecting a closed two-surface 
St = S(t, r), integrating k — ko over this surface, and then taking the limit r — > 00. 
Thus, 

MadmW = -jt { (* - k )V^d 2 6. (4.3.11) 

87r JS(t,r-+oo) 

Here, S(t, r) denotes a surface of constant t and r which becomes a round two- 
sphere of area 4nr 2 as r — > 00. This limit, which is taken while keeping t fixed, is 
what defines "spatial infinity" . 

There exists another way of reaching infinity, and to this new limiting procedure 
corresponds a distinct notion of mass. This is the Bondi-Sachs mass, which is 
obtained by taking S(t, r) to "null infinity" instead of spatial infinity. To define 
this we introduce the null coordinates u = t — r (retarded time) and v = t + r 
(advanced time). In these coordinates, a two-surface of constant t and r becomes a 
surface of constant u and v, which we denote S(u,v). Null infinity corresponds to 
the limit v — > 00 keeping u fixed, and the Bondi-Sachs mass is defined by 

M BS (u) = --L / (jfe - k )V^d 2 6. (4.3.12) 

OTT Js(u,v^oo) 
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The physical importance of the Bondi-Sachs mass comes from the fact that when an 
isolated object emits radiation (in the form, say, of electromagnetic or gravitational 
waves), the rate of change of Mbs(w) is directly related to the outward flux of 
radiated energy. If F denotes this flux, then the Bondi-Sachs mass satisfies 



dM BS 
du 



= -<f F^/b r d 2 6. (4.3.13) 

J S(u,v— >co) 

Thus, the mass of a radiating object decreases as the radiation escapes to infinity. 
The proof of this statement is rather involved; it can be found in the original papers 
by Bondi, Sachs, and their collaborators. 

4-3.5 Distinction between ADM and Bondi-Sachs masses: Vaidya 

spacetime 

For stationary spacetimes, the ADM and Bondi-Sachs masses are identical: there is 
no distinction. For the dynamical spacetime of an isolated body emitting gravita- 
tional (or other types of) radiation, the two notions of mass are distinct. For such a 
system, the Bondi-Sachs mass decreases according to Eq. (4.3.13), while the ADM 
mass stays constant. 

The metric of a radiating spacetime is difficult to write down; usually it is 
expressed as a messy expansion in powers of 1/r. We shall not attempt to deal with 
these complications here. For the purpose of illustrating the difference between 
the ADM and Bondi-Sachs masses, we shall instead adopt a simple spherically- 
symmetric model. Consider the Schwarzschild metric expressed in terms of the null 
coordinate u = t — r — 2Mln(r/2M — 1), and allow the mass parameter M to become 
a function of retarded time: M — > m(u). This new metric is given by 

ds 2 = -fdu 2 -2dudr + r 2 dn 2 , f = 1 - 2m(u)/r, (4.3.14) 

and it is a good candidate to represent a radiating spacetime. To see if it makes a 
sensible solution to the Einstein field equations, let us examine the Einstein tensor, 
whose only nonvanishing component is G uu = —(2/r 2 )(dm/du). This means that 
the stress-energy tensor must be of the form 

dm/du ,.„,, 
T «/3 = ~~^a~ lal ^ (4.3.15) 

where l a = —d a u is tangent to radial, outgoing null geodesies. This stress-energy 
tensor describes a pressureless fluid with energy density p = (— dm / du) / (4nr 2 ) mov- 
ing with a four- velocity l a . Such a fluid is usually referred to as null dust; it gives 
a good description of high-frequency radiation. It is easy to check that the form 
(function of u) / r 2 for the energy density is dictated by energy- momentum conser- 
vation. You may also verify that all the standard energy conditions are satisfied 
by T a p if dm/du < 0, that is, if m decreases with increasing retarded time. We 
conclude that the metric of Eq. (4.3.14), called the outgoing Vaidya metric, makes 
a physically reasonable solution to the Einstein field equations. 

We wish to compute the ADM and Bondi-Sachs masses for the Vaidya spacetime. 
The first step is to select a spacelike hypersurface £ bounded by a closed two-surface 
S; this hypersurface must asymptotically coincide with a surface i = constant of 
Minkowski spacetime. A suitable choice is to let £ be a surface of constant t = u+r, 
for which the unit normal 



n a 



-(2- f)-^ da (u + r) 



is everywhere timelike. From Eq. (4.3.14) we obtain that the induced metric on £ 
is 

h ab dy a dy b = (2 - /) dr 2 + r 2 dtt 2 . 
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For S we choose the two-sphere r = R, where R is a constant much larger than the 
maximum value of 2m(u); eventually we will take the limit R — > oo. Recall that 
spatial infinity corresponds to keeping t fixed while taking the limit (which means 
that u — > — oo), whereas null infinity corresponds to keeping u fixed while taking 
the limit. The unit normal on S is r a = (2 — /) 1//2 d a r, and the induced metric is 
a AB de A de B =R 2 dn 2 . 
First we calculate 



M{S) = -^- I (k - k )V^ d 2 6 

87T J s 



for the bounded two-surface S; the two different limits to infinity will be taken next. 
The extrinsic curvature of S embedded in £ is calculated as 



2m(u) 

1 + 



R 



-i/2 ^ 2 
~ R 



l_^ +0(ir 2 ) 



and the extrinsic curvature of S embedded in flat space is ko = 2/R. Subtracting, 
we have that k — ko = —2m(u)/R 2 + 0(R~ 3 ), and integrating over S yields M(S) = 
m{u) +0{R~ 1 ). 

We may now take the limit R -> oo. As was mentioned previously, the ADM 
mass is obtained by keeping t = u + R fixed while taking the limit. This gives 

ATadmW =m(-oo), (4.3.16) 

and we see that the ADM mass is a constant, equal to the initial value of the 
mass function. We may therefore say that Madm represents all the mass initially 
present in the spacetime. (This interpretation is quite general and not limited to 
this specific example.) For the Bondi-Sachs mass, we must keep u fixed while taking 
the limit. This gives 

M BS (u)=m(u), (4.3.17) 

and we see that the Bondi-Sachs mass is identified with the mass function of the 
Vaidya spacetime. It decreases in response to the outflow of radiation described by 
the stress-energy tensor of Eq. (4.3.15). Notice that the field equation 

P = -4nr 2 T uu = -4nr 2 (-T r u ) = -4nr 2 F 

is compatible with the general mass-loss formula, Eq. (4.3.13). 

It may appear paradoxical that the ADM mass of a dynamical spacetime should 
be a constant. This, however, is what should be expected of a radiating spacetime 
(Fig. 4.5). The ADM mass represents all the mass present on a spacelike hypersur- 
face of constant t. This hypersurface intersects the central object whose mass does 
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decrease as a consequence of radiation loss. But this does not mean that the ADM 
mass should decrease, because the hypersurface intersects also the radiation, and 
the ADM mass accounts for both forms of energy. The net result is a conserved 
quantity. On the other hand, the Bondi-Sachs mass represents all the mass present 
on a null hypersurface of constant u. Because this hypersurface fails to intersect 
any of the radiation that was emitted prior to the retarded time u, the net result is 
a quantity that decreases with increasing retarded time. 

4-3.6 Transfer of mass and angular momentum 

We shall now derive expressions for the transfer of mass and angular momentum 
across a hypersurface £ in a stationary, axially symmetric spacetime. 
To begin, consider the vector fields 

e a = -T%e (tV e a = T a p^ V (4-3.18) 

where T a/3 is a test stress-energy tensor that does not influence the spacetime ge- 
ometry. From the definition of the stress-energy tensor, e a can be interpreted as an 
energy-density flux vector, while l a is interpreted as an angular-momentum-density 
flux vector. 

To see this clearly, consider the simple case of dust, a perfect fluid with stress- 
energy tensor T a/3 = pu a vP , where p is the rest-mass density, and u a the four- 
velocity. Energy-momentum conservation implies that u a satisfies the geodesic 
equation, and that j a = pu a is a conserved vector: j a a = 0. This vector can be 
interpreted as the dust's momentum density, or equivalently, as a rest- mass flux 
vector. Then e a = Ej a and l a = Lj a , where E = — u a £^ is the conserved energy 

per unit rest mass, and L = u a £"^ the conserved angular momentum per unit rest 

mass. (As we have indicated, both E and L are constants of the motion.) These 
relations show quite clearly that e a represents a flux of energy density, while £ a is 
a flux of angular-momentum density. 

The vectors e a and i a are divergence-free. For example, 

the first term vanishes by virtue of energy-momentum conservation, and the second 
vanishes because £(t)/3 ;a is an antisymmetric tensor field. This implies that the 
integral of e a or l a over a hypersurface d"V enclosing a four-dimensional region V 
is identically zero. For example, 

I e a dT, a = 0. 
Joy 

This equation states that the total transfer of energy across a closed hypersurface 
df is zero. This is clearly a statement of conservation of total energy — or total 
mass. 

The boundary d"V can be partitioned into any number of pieces. If one such 
piece is the hypersurface S, then the integral of e a over S represents the mass 
transferred across this piece of df. Thus, 



AM = - 




(4.3.19) 



is the mass transferred across the hypersurface S, and similarly, 




(4.3.20) 
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is the angular momentum transferred across S. 

For illustration, let us return to our previous example, and let us choose £ to be 
spacelike and orthogonal to the vector field u a . Then dT, a = —u a Vhd 3 y, and we 
find that AM = J^Ep\/hd 3 y and A J = f^Lp\fhd 3 y. The first equation states 
that the transfer of energy across £ is the integral of Ep, the energy density. The 
second equation comes with a very similar interpretation. 



4.4 Bibliographical notes 

During the preparation of this chapter I have relied on the following references: 
Arnowitt, Deser, and Misner (1962); Bondi, van der Burg, and Metzner (1962); 
Brown and York (1993), Brown, Lau, and York (1997); Carter (1979); Hawking 
and Horowitz (1996); Sachs (1962); Sudarsky and Wald (1992); and Wald (1984). 
More specifically: 

An overview of the Lagrangian and Hamiltonian formulations of general rela- 
tivity is given in Appendix E of Wald. The Hamiltonian formulation was initiated 
by Arnowitt, Deser, and Misner, who also introduced the ADM mass. Early treat- 
ments of the Hamiltonian formulation often discarded the all-important boundary 
terms; careful treatments are given in Sudarsky and Wald, Brown and York, and 
Hawking and Horowitz. (Problem 7 below is based on this last paper.) The Hamil- 
tonian definitions for mass and angular momentum are taken from Brown and York; 
the discussion of Sec. 4.2.4 is also based on their paper. Sections 4.3.3 and 4.3.5 
are based on Carter's Sees. 6.6.1 and 6.6.2, respectively. The Bondi-Sachs mass 
was introduced by Bondi and his collaborators in an effort to put the notion of 
gravitational-wave energy on a firm footing. The definition given in Sec. 4.3.4 is 
due to Brown, Lau, and York. I would like to point out that the first occurrence 
of the (k — ko) formula for the ADM mass can be found in a 1988 paper by Katz, 
Lynden-Bell, and Israel. 



4.5 Problems 

1. The Lagrangian density for the free electromagnetic field is 

<g — — F a/3 F a 

167T aP ' 

where F a p = Ap. a — A a .p is the Faraday tensor, expressed in terms of the 
vector potential A a . 

a) Derive the Maxwell field equations for vacuum, F a ^.^ = 0, on the basis of 

this Lagrangian density. 

b) Show that the stress-energy tensor for the electromagnetic field is given 

by i i 

2. The Lagrangian density for a point particle of mass m moving on a world line 
z a (A) is given by 

Jif = — m J \J—g a pz a zP 5±{x, z) dX, 
where 5a{x,x') is a four-dimensional, scalarized (^-function satisfying 

/ 5i{x,x')y/^ ~gd A x = 1 
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if x' is within the domain of integration; we also have z a = dz a /d\, and the 
parameterization of the world line is arbitrary. 

a) Derive an expression for the stress-energy tensor of a point particle. To 

simplify this expression, set dX = dr (with r denoting proper time on 
the world line) at the end of the calculation. 

b) Prove that when it is applied to a point particle, the statement T"^.^ = 

gives rise to the geodesic equation for u a = dz a /dr. 

c) Explain whether the result of part b) constitutes a valid proof of the state- 

ment that the Einstein field equations predict the motion of a massive 
body to be geodesic. 

3. Calculate the gravitational action Sq for a region f of Schwarzschild space- 
time. Take f to be bounded by the hypersurfaces S^, St a , £.r, and T, p , 
where (£t a ) is the spacelike hypersurface described by t = fa (t = fa), and 
where £_r (E p ) is the three-cylinder at r = R (r = p). Here, 2M < p < R. 
At the end of the calculation, take the limits R — > oo and p — > 2M. 

4. Derive Eq. (4.2.52), the evolution equation for the extrinsic curvature. You 
may use p ab = —V ab as a starting point, or proceed from scratch with the 
definition = £t(n a; pe"e^). [Either way, the calculation is tedious! You 
may want to consult York (1979).] 

5. Recall that in Sec. 3.6.5 we introduced a mass function m(r) that determines 
the three-metric of a spherically symmetric hypersurface. Prove that 

-— I (k-k )Vvd 2 6 = r(l- ^1 - 2ra/rY 
8tt J s{r) \ J 

where S(r) is a two-surface of constant r. Use this to show that ra(oo) is the 
ADM mass of this hypersurface. 

6. In this problem we explore some consequences of Eq. (4.3.9), which gives an 
expression for the ADM mass of a stationary spacetime. 

a) Prove that the right-hand side of Eq. (4.3.9) is independent of the choice 

of hypersurface S. 

b) Show that if T a/3 is the stress-energy tensor of a static perfect fluid, then 

Eq. (4.3.9) reduces to 

M = [ (p + 3p)e' !/ Vhd 3 y, 

where p is the mass density, p the pressure, and e 2 * = — 9ap£" t )£(t) - 
[Hints: A perfect fluid is static if its four- velocity u a is parallel to 
You may assume that the spacetime also is static] 

c) Specialize to spherical symmetry, and write the spacetime metric as 

ds 2 = -e 2 * dt 2 + (1 - 2m/r)" 1 dr 2 + r 2 dtf, 

in which $ and m are functions of r. Refer back to the result of Problem 
5 and deduce the identity 

f p(l-2m/r) 1/2 dV = [(p + 3p)e*dV, 

where dV = Vhd 3 y is the natural volume element on the hypersurface 
S. 
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d) Specialize now to a weak-field situation, for which the metric can be ex- 
pressed as 

ds 2 = -(1 + 2$) dt 2 + (1 - 2$)(cfe 2 + dy 2 + dz 2 ); 

the Newtonian potential $ is a function of f = ^Jx 1 +y 2 + z 2 . Work- 
ing consistently in the weak-field approximation, show that the identity 
derived in part c) reduces to 

— I \V$\ 2 dV = 3 f pdV, 

in which dV and all vectorial operations refer to the three-dimensional 
flat space of ordinary vector calculus. The left-hand side represents 
(minus) the total gravitational potential energy of the system. For a 
monoatomic ideal gas in thermodynamic equilibrium, the right-hand side 
represents twice the total kinetic energy of the system. This equation 
is therefore a formulation of the virial theorem of Newtonian gravita- 
tional physics. The identity of part c) can then be interpreted as the 
general-relativistic version of the virial theorem. 

7. The ADM mass is usually defined by 

M = -L I (D b lab - D a7 )r a V^d 2 6, 
16tt Js^" 

which is a very different expression from the one appearing in Sec. 4.3.1. Here, 
S is the two-surface that encloses the spacelike hypersurface S. If /i f, is the 
metric on £ in arbitrary coordinates y a , then 7 & = /i f, — /i° 6 , where h® b is the 
metric of flat space in the same coordinates. We also have 7 = 7° a , and D a 
is the covariant derivative associated with the flat metric h° b , which is used 
to raise and lower all indices. Finally, r° is the unit normal of the surface S, 
and s/atfiO is the surface element on S. 

The purpose of this problem is to prove that this definition is equivalent to 
the one given in the text, 

M = ~ I (k-k )^d 2 6, 

87T Js-f-oo 

where k (ko) is the extrinsic curvature of S embedded in S (flat space). You 
may proceed along the following lines: 

Because both expressions are invariant under a coordinate transformation, we 
may use, in a neighbourhood of S, the coordinates (£,6 A ), where I is proper 
distance off S in the direction orthogonal to S, and 6 A are the coordinates 
on S which are Lie transported off S along curves orthogonal to S. In these 
coordinates, the metric on S is given by 

h ab dy a dy b = dl 2 + ~o AB (l) d8 A d8 B , 

where &ab{£) (which also depends on 6 A ) is such that ^^(O) = gab, the 
induced metric on S. Similarly, 

h° ab dy a dy b = dl 2 + a%{i) d9 A d6 B . 

Because the induced metrics must agree on S, we also have g ab (0) = gab- 
This implies that 7 f, = on S. 
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Using this information, show that both expressions for M reduce to the same 
form, 

M = --L/ h 0ab lab ^d 2 6. 
16?r 

This is sufficient to prove that the two expressions are indeed equivalent. 

8. In this problem we study the transport of energy and angular momentum by 
a scalar field tp in flat spacetime. The metric is ds 2 = —dt 2 + dr 2 + r 2 d£l 2 , 
and the scalar field satisfies the wave equation g al3 ip-ap = — 4np, where p is 
a specified source. It can be shown that in the wave zone (where r is much 
larger than a typical wavelength of the radiation) , the field is given by 

1 oo I 

V(*,r,M) = -£ Y, ai m (u)Y im (e,<P) + 0(r- 2 ), 

i=0 m=-l 

where Yt m {Q,<$) are spherical harmonics; the amplitudes ai m are constructed 
from /?, and they are functions of retarded time u = t — r. The scalar field 
comes with a stress-energy tensor 

T a p = tp, a tp,p - ^{tp^tp^gap, 

and we are interested in the transfer of energy and angular momentum across 
a null hypersurface X defined by v = constant, where v = t + r is advanced 
time. 

a) Show that dT, a = —r 2 k a dud£l, where k a = —\d a v and dQ, = sin0 d9d<f>, 

is a surface element on S. 

b) Prove that for any test field producing a stress-energy tensor T a P, the 

amount of energy crossing S per unit retarded time is 

^ = I r 2 T a gk a tP dfl, 
du Js 

where t a = dx a /dt and 5 is a two-sphere of constant u and v. Prove 
also that the amount of angular momentum flowing across S is given by 

j r 2 T afi k a <\P dn, 



the preceding expressions reduce to 

CO i 

^2 \at m (u)\ 

1=0 m=-i 
I 

X! imil t-m{u)a* lm {u) 

m=-i 

in the limit v — > oo. Here, an overdot indicates differentiation with 
respect to u, and an asterisk denotes complex conjugation. 

d) Suppose that the source producing the scalar radiation is in rigid rotation 
around the z axis, in the sense that the t and cf> dependence of p resides 



dJ_ _ 
du 

where (f> a = dx a /d(f>. 
c) Show that for scalar radiation, 

dE _ 
du 

and 

dJ ^ 
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entirely in the combination </>—£lt, where fi is a constant angular velocity. 
Prove that in this situation, the field satisfies 

v>«r = o, 

where £ a = t" + £l<fi a . Prove also that in the limit v — > oo, the transfers 
of energy and angular momentum are related by 

— -n — 

du du ' 

This relation applies to any type of radiation emitted by a source in 
rigid rotation. It is valid also in curved spacetimes, provided that the 
spacetime is stationary, axially symmetric, and asymptotically flat. 



Chapter 5 
Black holes 



The final chapter of this book is devoted to one of the most successful applications 
of general relativity, the mathematical theory of black holes. In the first part of the 
chapter we explore three exact solutions to the Einstein field equations that describe 
black holes; those are the Schwarzschild (Sec. 5.1), Reissner-Nordstrom (Sec. 5.2), 
and Kerr (Sec. 5.3) solutions. In Sec. 5.4 we move away from the specifics of 
those solutions and consider properties of black holes that can be formulated quite 
generally, without relying on the details of a particular metric. In the final section of 
this chapter, Sec. 5.5, we present the four fundamental laws of black-hole mechanics. 

The most important feature of a black-hole spacetime is the event horizon, a null 
hypersurface which acts as a causal boundary between two regions of the spacetime, 
the interior and exterior of the black hole. Many physical quantities associated with 
the black hole, such as its mass, angular momentum, and surface area, are defined 
by integration over the event horizon. The integration techniques introduced in 
Chapter 3 will be put to direct use here, as well as the notions of mass and angular 
momentum encountered in Chapter 4. And since the event horizon is generated by 
a congruence of null geodesies, the methods introduced in Chapter 2 will also be 
part of our discussion. So here it all comes together in one final glorious moment! 



5.1 Schwarzschild black hole 

5.1.1 Birkhoff's theorem 

The Schwarzschild metric, 

9 / 2M\ 9 / 2M\~ 1 9 9 ^ 9 , 
ds 2 = -[l- — \dt 2 + J dr 2 +r 2 dn 2 , (5.1.1) 

is the unique solution to the Einstein field equations that describes the vacuum 
spacetime outside a spherically symmetric body of mass M. While this object 
could have a time-dependent mass distribution, the external spacetime is necessarily 
static, and its metric is given by Eq. (5.1.1). This statement, known as Birkhoff's 
theorem, implies that a spherical mass distribution cannot emit gravitational waves. 

The proof of the theorem goes as follows. The metric of a spherically symmetric 
spacetime can always be cast in the form 

ds 2 = -e^f dt 2 + J' 1 dr 2 + r 2 dU 2 , (5.1.2) 

involving the two arbitrary functions ip(t,r) and f(t,r). It is convenient to also 
introduce a mass function m(t,r) defined by 

/.l-S. (5.1.3) 
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For the metric of Eq. (5.1.2), the Einstein field equations are 
dr v th dt v l > 

(5.1.4) 

^ = 4 OT /- 1 (-n + T;). 

The first two equations motivate the name "mass" for the function m(i,r), as — T\ 
represents the density of mass-energy and — T r t its outward radial flux; they imply 
that in vacuum, m(i, r) = M, a constant. The third gives tp' = 0, and tp(t, r) can be 
set equal to zero without loss of generality. The Schwarzschild solution is thereby 
recovered. 



5.1.2 Kruskal coordinates 

The difficulties of the Schwarzschild metric at r = 2M are well known. While the 
spacetime is perfectly well behaved there, the coordinates (t, r) become singular at 
r = 2M — they are no longer in a one-to-one correspondence with spacetime events. 
This problem can be circumvented by introducing another coordinate system. The 
following construction originates from the independent work of Kruskal (1960) and 
Szekeres (1960). 

Consider a swarm of massless particles moving radially in the Schwarzschild 
spacetime — t and r vary, but not 6 and <j>. It is easy to check that ingoing 
particles move along curves v = constant, while outgoing particles move along 
curves u = constant, where 

u = t — r*, v =t + r*, 

(5.1.5) 

r 



h 



dr 



2M/r 



r + 2Mln 



2M 1 



In a spacetime diagram using v (advanced time) and u (retarded time) as oblique 
coordinates (both oriented at 45 degrees), the massless particles propagate at 45 
degrees, just as in flat spacetime (Fig. 5.1). The null coordinates (u, v) are therefore 
well suited to the description of (radial) null geodesies. In these coordinates, the 
Schwarzschild metric takes the form 

ds 2 = -(l-2M/r)dudv + r 2 dtt 2 . (5.1.6) 

Here, r appears no longer as a coordinate, but as the function of u and v defined 
implicitly by r*(r) = |(i> — u). In these coordinates, the surface r = 2M appears 
at v — u = — oo, and it is still the locus of a coordinate singularity. 

To see how this coordinate singularity might be eliminated, we focus our at- 
tention on a small neighbourhood of the surface r = 2M, in which the rela- 
tion r*(r) can be approximated by r* ~ 2M\n\r/2M — 1|. This implies that 
r/2M ~ 1 ± e T '' 2M = 1 ± e ("-«)/ 4M , and / ~ ± e (*-«)/ 4M . Here and below, the 
upper sign refers to the part of the neighbourhood corresponding to r > 2M, while 
the lower sign refers to r < 2M. The metric (5.1.6) becomes 



This expression motivates the introduction of a new set of null coordinates, U and 
V, defined by 

C/ = Te -"/4M j y = e „/4M_ (5L7) 
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outgoing ray 



Figure 5.1: Spacetime diagram based on the (u,v) coordinates. 

It is now clear that when expressed in terms of these coordinates, the metric will 
be well behaved near r = 2M. Going back to the exact expression (5.1.5) for r*, 
we have that e r * l' 2M = e (*-«)/ 4M = ^UV, or 

eTl2M {ii- l ) = - uv ' ^ 

which implicitly gives r as a function of UV. You may check that the Schwarzschild 
metric is now given by 

ds 2 = _WMS_ e -r/2M dud y + r 2 dn 2 (g j g) 

r 

This is manifestly regular at r = 2M. The coordinates U and V are called null 
Kruskal coordinates. In a Kruskal diagram (a map of the U-V plane; Fig. 5.2), 
outgoing light rays move along curves U = constant, while ingoing light rays move 
along curves V = constant. 

In the Kruskal coordinates, a surface of constant r is described by an equa- 
tion of the form UV = constant, which corresponds to a two-branch hyperbola in 
the U-V plane. For example, r = 2M becomes UV = 0, while r = becomes 
UV = 1. There are two copies of each surface r = constant in a Kruskal diagram. 
For example, r = 2M can be either U = or V = 0. The Kruskal coordinates 
therefore reveal the existence of a much larger manifold than the portion covered 




Figure 5.2: Kruskal diagram. 
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by the original Schwarzschild coordinates. In a Kruskal diagram, this portion is 
labeled I. The Kruskal coordinates do not only allow the continuation of the metric 
through r = 2M into region II, they also allow continuation into regions III and 
IV. These additional regions, however, exist only in the maximal extension of the 
Schwarzschild spacetime. If the black hole is the result of gravitational collapse, 
then the Kruskal diagram must be cut off at a timelike boundary representing the 
surface of the collapsing object. Regions III and IV then effectively disappear 
below the surface of the collapsing star. 

5.1.3 Eddington-Finkelstein coordinates 

Because of the implicit nature of the relation between r and UV, the Kruskal 
coordinates can be awkward to use in some computations. In fact, it is rarely 
necessary to employ coordinates that cover all four regions of the Kruskal diagram, 
although it is often desirable to have coordinates that are well behaved at r = 2M. 
In such situations, choosing v and r as coordinates, or u and r, does the trick. 
These coordinate systems are called ingoing and outgoing Eddington-Finkelstein 
coordinates, respectively. 

It is easy to check that in the ingoing coordinates, the Schwarzschild metric 
takes the form 

ds 2 = -(1 - 2M/r)dv 2 +2dvdr + r 2 dtt 2 , (5.1.10) 
while in the outgoing coordinates, 

ds 2 = -(1 - 2M/r)du 2 - 2 dudr + r 2 dtt 2 . (5.1.11) 

It may also be verified that the (v,r) coordinates cover regions I and II of the 
Kruskal diagram, while u and r cover regions IV and I. 

The Eddington-Finkelstein coordinates can also be used to construct spacetime 
diagrams (Fig. 5.3), but these do not have the property that both ingoing and 
outgoing null geodesies propagate at 45 degrees: While ingoing light rays move 
with dv = 0, that is, along coordinate lines that can be oriented at 45 degrees, the 
outgoing rays move with dv/dr = 2/(1 — 2M/r), that is, with a varying slope. 

5.1.4 Painleve-Gullstrand coordinates 

Another useful set of coordinates for the Schwarzschild spacetime are the Painleve- 
Gullstrand coordinates first considered in Sec. 3.13, Problem 1. Here, as with the 
Eddington-Finkelstein coordinates, the spatial coordinates (r, 6, <f>) are the same as 



1 



% 

" = 



r = 2M 



outgoing 



ingoing 



Figure 5.3: Spacetime diagram based on the (v,r) coordinates. 
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in the original form of the metric, Eq. (5.1.1), but the time coordinate is different: 
T is proper time as measured by a free-falling observer starting from rest at infinity 
and moving radially inward. 

The four- velocity of such an observer is given by u a d a = / _1 dt — \/l — / d r , 
where / = 1 — 2M/r. From this we deduce that u a = —d a T, where the time function 
T is obtained by integrating dT = dt + f~ x ^l — J dr = dr, where r is proper time. 
(Integration is elementary, and the result appears in Sec. 3.13, Problem 1.) After 
inserting this expression for dt into Eq. (5.1.1), we obtain the Painleve-Gullstrand 
form of the Schwarzschild metric: 

ds 2 = -dT 2 + (dr + ^2M/rdT) 2 + r 2 dfi. (5.1.12) 

The coordinates (T,r,6,(f>) give rise to a metric that is regular at r = 2M, in 
correspondence with the fact that our free-falling observer does not consider this 
surface to be in any way special. Because this observer originates in region I of the 
spacetime (at r = oo) and ends up in region II (at r = 0), the new coordinates 
cover only these two regions of the Kruskal diagram. By reversing the motion — 
letting dr become — dr in Eq. (5.1.12) — an alternative coordinate system can be 
produced that covers regions VI and I instead. 

From Eq. (5.1.12) we infer a rather striking property of the Painleve-Gullstrand 
coordinates: the hypersurfaces T = constant are all intrinsically flat. This can be 
seen directly from the fact that the induced metric on any such hypersurface is 
given by ds 2 = dr 2 + r 2 dfl. 

5.1.5 Penrose-Carter diagram 

The double-null Kruskal coordinates make the causal structure of the Schwarzschild 
spacetime very clear, and this is their main advantage. Another useful set of double- 
null coordinates is obtained by applying the transformation 

f/ = arctan[/, V = arctanV: (5.1.13) 

This rescaling of the null coordinates does not affect the appearance of radial light 
rays, which still propagate at 45 degrees in a spacetime diagram based on the new 
coordinates (Fig. 5.4). However, while the range of the initial coordinates was 
infinite (for example, — oo < U < oo), it is finite for the new coordinates (for 
example, — tt/2 < U < it/2). The entire spacetime is therefore mapped into a finite 
domain of the U-V plane. This compactification of the manifold introduces bad 
coordinate singularities at the boundaries of the new coordinate system, but these 




Figure 5.4: Compactified coordinates for the Schwarzschild spacetime. 
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Figure 5.5: Penrose-Carter diagram of the Schwarzschild spacetime. 



are of no concern when the purpose is simply to construct a compact map of the 
entire spacetime. 

In the new coordinates, the surfaces r = 2M are located at U = and V = 0, 
and the singularities at r = 0, or UV = 1, are now at U + V = ±n/2. The 
spacetime is also bounded by the surfaces U = ±n/2 and V = ±ir/2. The four 
points (U,V) = (±§, ±f ) are singularities of the coordinate transformation: In the 
actual spacetime, the surfaces U = 0, U = oo, and UV = 1 never meet. 

It is useful to give names to the various boundaries of the compactified spacetime 
(Fig. 5.5). The surfaces U = n/2 and V = n/2 are called future null infinity, and are 
labeled J? + (pronounced "scri plus" ) . The diagram makes it clear that J? + contains 
the future endpoints of all outgoing null geodesies (those along which r increases). 
Similarly, the surfaces U = —n/2 and V = —iv/2 are called past null infinity, and 
are labeled J?~. These contain the past endpoints of all ingoing null geodesies 
(those along which r decreases). The points at which J? + and meet are called 
spacelike infinity, and are labeled i°. These contain the endpoints of all spacelike 
geodesies. The points (U , V) = (0, f ) and (U, V) = (f , 0) are called future timelike 
infinity, and are labeled i + . These contain the future endpoints of all timelike 
geodesies that do not terminate at r = 0. Finally, the points (U, V) = (0, — f ) and 
(U, V) = (— f , 0) are called past timelike infinity, and are labeled i~ . These contain 
the past endpoints of all timelike geodesies that do not originate at r = 0. Table 
5.1 provides a summary of these definitions. 

Compactified maps such as the one displayed in Fig. 5.5 are called Penrose- 
Carter diagrams. They display, at a glance, the complete causal structure of the 
spacetime under consideration. They make a very useful tool in general relativity. 



5.1.6 Event horizon 

On a Kruskal diagram (Fig. 5.2), all radial light rays move along curves U = 
constant or V = constant. The light cones are therefore oriented at 45 degrees, 



Table 5.1: Boundaries of the compactified Schwarzschild spacetime. 

Label Name Definition 

J? + Future null infinity v = oo, u finite 

Past null infinity u = — oo, v finite 

i° Spatial infinity r = oo, t finite 

i + Future timelike infinity t = oo, r finite 

i~ Past timelike infinity t = — oo, r finite 
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and timelike world lines, which lie within the light cones, move with a slope larger 
than unity. The one-way character of the surface r = 2M separating regions I and 
II of the Schwarzschild spacetime is then quite clear: An observer crossing this sur- 
face can never retrace her steps, and cannot elude an encounter with the curvature 
singularity at r = 0. It is also clear that after crossing r = 2M, the observer can 
no longer send signals to the outside world, although she may continue to receive 
them. The surface r = 2M therefore prevents any external observer from detecting 
what goes on inside. In this context, it is called the black-hole's event horizon. The 
region within the event horizon (region II) is called the black-hole region of the 
Schwarzschild spacetime. 

The surface r = 2M that separates regions I and II must be distinguished from 
the surface r = 2M that separates regions IV and I. It is clear that the latter 
is an event horizon to all observers living inside region IV (who cannot perceive 
what goes on in region I). It is also a one-way surface, because observers from the 
outside cannot cross it. To distinguish between the two surfaces r = 2M, it is usual 
to refer to the first as a future horizon, and to the second as a past horizon. The 
region within the past horizon (region IV) is called the white-hole region of the 
Schwarzschild spacetime. 



5.1.7 Apparent horizon 

Another important property of the surface r = 2M has to do with the behaviour 
of outgoing light rays in a neighbourhood of this surface. Here, the term outgoing 
will refer specifically to those rays which move on curves U = constant. This is 
potentially confusing, because the radial coordinate r does not necessarily increase 
along those rays; in fact, r increases only if U < (outside the black hole), and 
it decreases if U > (inside the black hole). While the term "outgoing" should 
perhaps be reserved to designate rays along which r always increases, this choice of 
terminology is fairly standard. Similarly, we will use the term ingoing to designate 
light rays which move on curves V = constant. If V > 0, then r decreases along 
the ingoing rays; if V < 0, r increases. 

We will show that the expansion of a congruence of outgoing light rays (as 
defined above) changes sign at r = 2M. (This should be obvious just from the fact 
that r increases along the geodesies that are outside r = 2M, but decreases along 
geodesies that are inside.) Outgoing light rays have 

k a = -d a U (5.1.14) 

as their (affinely parameterized) tangent vector, and their expansion is calculated 
as 6 = k a . a = \g\~ 1/2 (\g\ 1/2 k a ) ta . In Kruskal coordinates, k v = \guv\~ 1 is the 
only nonvanishing component of k a , and \g\ x ^ 2 = \guv |?* 2 sin 2 6. This gives 6 = 
2r y /r\guv\, and using Eq. (5.1.8) and (5.1.9), we obtain 

e = k a a = --^-. (5.i.i5) 

2Mr v ' 

As was previously claimed, the expansion is positive for U < (in the past of 
r = 2M) and negative for U > (in the future of r = 2M). The expansion 
therefore changes sign at r = 2M, and in this context, this surface is called an 
apparent horizon. (A similar calculation would reveal that for ingoing light rays, 
the expansion is negative everywhere in regions I and II.) 

To give a proper definition to the term "apparent horizon" , we must first intro- 
duce the notion of a trapped surface (Fig. 5.6). Let E be a spacelike hypersurface. 
A trapped surface on E is a closed, two-dimensional surface S such that for both 
congruences (ingoing and outgoing) of future-directed null geodesies orthogonal to 
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Figure 5.6: Trapped surfaces and apparent horizon of a spacelike hypersurface. 



5, the expansion 6 is negative everywhere on S. (It should be clear that each two- 
sphere U, V = constant in region II of the Kruskal diagram is a trapped surface.) 
Let & be the part of £ that contains trapped surfaces; this is known as the trapped 
region of S. The boundary of the trapped region, 83", is what is defined to be 
the apparent horizon of the spacelike hypersurface £. (In Schwarzschild spacetime, 
this would be any two-sphere at r = 2M.) Notice that the apparent horizon is 
a marginally trapped surface: For one congruence of null geodesies orthogonal to 
<9iT, 6 = 0. Notice also that the apparent horizon designates a specific two-surface 
S on a given hypersurface X. The apparent horizon can generally be extended to- 
ward the future (and past) of £, because hypersurfaces to the future (and past) of 
£ also contain apparent horizons. The union of all these apparent horizons forms 
a three-dimensional surface si called the trapping horizon of the spacetime. (In 
Schwarzschild spacetime, this would be the entire hypersurface r = 2M.) In the 
following we will not distinguish between the two-dimensional apparent horizon 
and the three-dimensional trapping horizon; we will refer to both as the apparent 
horizon. (This sloppiness of language is fairly standard.) 



5.1.8 Distinction between event and apparent horizons: Vaidya 

spacetime 

The event and apparent horizons of the Schwarzschild spacetime coincide, and it 
may not be clear why the two concepts need to be distinguished. This coinci- 
dence, however, is a consequence of the fact that the spacetime is stationary; for 
more general black-hole spacetimes, the event and apparent horizons are distinct 
hypersurfaces. To illustrate this we introduce a simple, non-stationary black-hole 
spacetime. 

We express the Schwarzschild metric in terms of ingoing Eddington-Finkelstein 
coordinates, 

ds 2 = -fdv 2 +2dvdr + r 2 dU 2 , (5.1.16) 
and we allow the mass function to depend on advanced time v. 

/ = 1-^M. (5.1.17) 

r 

This gives the ingoing Vaidya metric, a solution to the Einstein field equations with 
stress-energy tensor 

T «/3 = ^?W/3> (5.1.18) 

where l a = —d a v is tangent to ingoing null geodesies. This stress-energy tensor 
describes null dust, a pressureless fluid with energy density (dm / dv) / (4nr 2 ) and 
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four-velocity l a . (A similar, outgoing Vaidya solution was considered in Sec. 4.3.5. 
A notable difference between these solutions is that here, the mass function must 
increase for T a p to satisfy the standard energy conditions.) 

Consider the following situation. A black hole, initially of mass mi, is irradiated 
(with ingoing null dust) during a finite interval of advanced time (between v± and 
V2 ) so that its mass increases to m 2 . Such a spacetime is described by the Vaidya 
metric, with a mass function given by 

{mi v < Vi 

™>12(v) Vi < v < v 2 , 

m 2 v > v 2 

where mi2{v) increases smoothly from mi to m 2 . We would like to determine the 
physical significance of the surfaces r = 2mi , r = 2mi 2 (v) , and r = 2m 2 , and find 
the precise location of the event horizon. 

It should be clear that r = 2mi and r = 2m 2 describe the apparent horizon when 
v < Vi and v > w 2 , respectively. More generally, we will show that the apparent 
horizon of the Vaidya spacetime is always located at r = 2m (v). 

The null vector field k a dx a = —fdv + 2dr is tangent to a congruence of out- 
going null geodesies. It does not, however, satisfy the geodesic equation in affine- 
parameter form: As a brief calculation reveals, k a .pkP = k k a , where k = 2m(v)/r 2 . 
To calculate the expansion of the outgoing null geodesies, we need to introduce an 
affine parameter A* and a rescaled tangent vector k" = dx a /d\*. (The calculation 
can also be handled via the results of Sec. 2.6, Problem 8.) As was shown in Sec. 1.3, 
the desired relation between these vectors is k" = e~ r k a , where dF/dX = k(A), with 
A denoting the original parameter. We have 



k a 

— 




= 


-r, a k a ) 




dT\ 




dXj 


= e - r (*v 


-")■ 



Here, the factor k a . a — K is the congruence's expansion when measured in terms of the 
initial parameter A — it is equal to (6A)~ 1 d(6A)/dX, where 6 A is the congruence's 
cross-sectional area. The factor e~ r converts it to (SA)~ 1 d(SA)/dX* , and this oper- 
ation does not affect the sign of 6. A simple computation gives k a . a = 2(r — m)/r 2 , 
and we arrive at 

e T e=l[r-2m(v)]. 

So 6 = on the surface r = 2m(u), and we conclude that the apparent horizon 
begins at r = 2m\ for v < vi, follows r = 2mi 2 (t;) in the interval Vi < v < i> 2 , and 
remains at r = 2m 2 for v > i> 2 . 

We may now show that while the apparent horizon is a null hypersurface before 
v = vi and after v = u 2 , it is spacelike in the interval v± < v < u 2 . This follows at 
once from the fact that if $ = r — 2m{v) = describes the apparent horizon, then 

«/3$ $ R - -4^1 
9 9, a 9,p i ^ 

is negative (so that the normal a is timelike) if dm/dv > (so that the energy 
conditions are satisfied). We therefore see that the apparent horizon is null when 
the spacetime is stationary, but that it is spacelike otherwise. 

Where is the event horizon? Clearly, it must coincide with the surface r = 2m 2 
in the future of v = i> 2 . But what is its extension to the past of v = ii 2 ? Because 



134 



Black holes 



r = r = 2mi r = 2mi 




Figure 5.7: Black hole irradiated with ingoing null dust. 



the event horizon is defined as a causal boundary in spacetime, it must be a null 
hypersurface generated by null geodesies (more will be said on this in Sec. 5.4). The 
event horizon can therefore not coincide with the apparent horizon in the past of 
v = i>2- Instead, its location is determined by finding the outgoing null geodesies 
of the Vaidya spacetime that connect smoothly with the generators of the surface 
r = 2rri2- (See Fig. 5.7; a particular example is worked out in Sec. 5.7, Problem 2.) 

It is clear that the generators of the event horizon have to be expanding in the 
past of v = V2 if they are to be stationary (in the sense that 6 = 0) in the future. 
Indeed, supposing that the null energy condition is satisfied (which will be true if 
dm/dv > 0), the focusing theorem (Sec. 2.4) implies that the congruence formed by 
the null generators of the event horizon will be focused by the infalling null dust; a 
zero expansion in the future of v = i>2 guarantees a positive expansion in the past. 
The event horizon is therefore generated by those null geodesies that undergo just 
the right amount of focusing, so that after encountering the last of the infalling 
matter, their expansion goes to zero. 

The event horizon coincides with the apparent horizon only in the future of 
v = i)2- In the past, because the apparent horizon has a spacelike segment while the 
event horizon is everywhere null, the apparent horizon lies within the event horizon, 
that is, inside the black hole (Fig. 5.7). As we shall see in Sec. 5.4, this observation 
is quite general. 

It is a remarkable property of the event horizon that the entire future history of 
the spacetime must be known before its position can be determined: The black hole's 
final state must be known before the horizon's null generators can be identified. 
This teleological property is not shared by the apparent horizon, whose location at 
any given time (as represented by a spacelike hypersurface) depends only on the 
properties of the spacetime at that time. 



5.1.9 Killing horizon 

The vector t a = dx a /dt is a Killing vector of the Schwarzschild spacetime. While 
this vector is timelike outside the black hole, it is null on the event horizon, and it 
is spacelike inside: 



The surface r = 2M can therefore be called a Killing horizon, a hypersurface on 
which the norm of a Killing vector goes to zero. In static black- hole spacetimes, the 
event, apparent, and Killing horizons all coincide. 
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5.1.10 Bifurcation two-sphere 

The point (U, V) = (0, 0) in a Kruskal diagram, at which the past and future hori- 
zons intersect, represents the bifurcation two-sphere of the Schwarzschild spacetime. 
This two-surface is characterized by the fact that the Killing vector t a = dx a /dt 
vanishes there. To recognize this, we need to work out the components of this 
vector in Kruskal coordinates. From Eqs. (5.1.5) and (5.1.7) we get the relation 
e t/2M _ _v/U, and after using Eq. (5.1.8), we obtain 

TJ2 _ >-t)/2M ( _!_ _ A yl _ (r+t)/2M ( J_ _ A 

\2M y V - \2M J' 

Taking partial derivatives with respect to t, we arrive at 

t v = ~, t v = ^-. (5.1.19) 
AM AM v ' 

It follows immediately that t a = at the bifurcation two-sphere. It should be noted 
that the bifurcation two-sphere exists only in the maximally extended Schwarzschild 
spacetime. If the black hole is the result of gravitational collapse, then the bifurca- 
tion two-sphere is not part of the actual spacetime. 

According to our previous calculation, t v is the only nonvanishing component 
of the Killing vector on the future horizon. This implies that t a oc —d a U at U = 0, 
and we have the important result that t a is tangent to the null generators of the 
event horizon. This was to be expected from the fact that the event horizon of the 
Schwarzschild spacetime is also a Killing horizon. 



5.2 Reissner-Nordstrom black hole 

5.2.1 Derivation of the Reissner-Nordstrom solution 

The Reissner-Nordstrom (RN) metric describes a static, spherically symmetric black 
hole of mass M possessing an electric charge Q. We begin our discussion with a 
derivation of this solution to the Einstein-Maxwell equations. 

We assume that the electromagnetic-field tensor F a @ has no components along 
the 6 and <f> directions; this ensures that the field is purely electric when measured 
by stationary observers. Under this assumption, the only nonvanishing component 
is F tr . Maxwell's equations in vacuum are = -F"^ = |5|~ 1 ^ 2 (|5| 1 ^ 2 -F a/3 ),/3- Using 
the metric of Eq. (5.1.2), this implies (e^r 2 F tr )' = 0, or 

ptr _ e ~^Q, 

where Q is a constant of integration, to be interpreted as the black-hole charge. 
The stress-energy tensor for the electromagnetic field is 



and a few steps of algebra yield 

T^ = ^diag(-1, -1,1,1). (5.2.1) 

The Einstein field equations (5.1.4) imply m! = Q 2 /2r 2 , or m(r) = M - Q 2 /2r. 
The fact that T\ = T r r implies tp' = 0, so that tp can be set to zero without loss of 
generality. The RN solution is therefore 

ds2 = _ ( ! _ + 91) d f + ( , _ 2M + - ^ + r2 ^ (5 _ 2 _ 2) 
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with an electromagnetic-field tensor whose only nonvanishing component is 

F tT = %■ (5-2-3) 

Here, M is total (ADM) mass of the spacetime, and Q is the black hole's electric 
charge. 

To see that Q is indeed the charge, consider a nonsingular charge distribution 
on a spacelike hypersurface S, described by a current density j a . An appropriate 
definition for total charge is Q = J s j a dH a , or Q = (47r) _1 J" e F a ^.p dT, a after using 
Maxwell's equations. Using Stokes' theorem (Sec. 3.3.3), we rewrite this as an 
integral over a closed two-surface S bounding the charge distribution. This yields 
Gauss' law, 

Q = i- £f<*P dS af} . (5.2.4) 

The advantage of this expression for the total charge is that it is applicable even 
when the charge distribution is singular, which is the case in the present application. 
Also, this definition of total charge is in the same spirit as the previously encountered 
definitions for total mass and angular momentum (Sec. 4.3). Substituting Eq. (5.2.3) 
and evaluating for a two-sphere of constant t and r confirms that the Q appearing 
in Eq. (5.2.3) is indeed the black hole's total charge. 



5.2.2 Kruskal coordinates 
The function f(r) = 1 — 2M/r + Q 2 /r 2 has zeroes at r = r±, where 

r±=M± y/M 2 -Q 2 . (5.2.5) 

The roots are both real, and the RN spacetime truly contains a black hole, if 
|<3| < M. The special case of a black hole with \Q\ = M is referred to as an extreme 
RN black hole. If \Q\ > M, then the RN solution describes a naked singularity at 
r = 0. 

As for the Schwarzschild metric, the coordinates (t, r) are singular at the outer 
horizon (r = r + ), and new coordinates must be introduced to extend the metric 
across this surface. This can be done with the help of Kruskal coordinates. As we 
shall see, however, these coordinates fail to be regular at the inner horizon (r = r_): 
Another coordinate transformation is required to extend the metric beyond this 
surface. Thus, Kruskal coordinates are specific to a given horizon, and a single 
coordinate patch is not sufficient to cover the entire RN manifold (Fig. 5.8). As we 
shall see below, the outer horizon in an event horizon for the RN spacetime, and 
the inner horizon is an apparent horizon. 

Let us first take care of the extension across the outer horizon. We express the 
RN metric in the form 

ds 2 = -fdt 2 +f- 1 dr 2 +r 2 dn 2 , 
where / = 1 — 2M/r + Q 2 /r 2 . Near r = r + this function can be approximated by 

f(r) ~ 2ft+(r-r+), 
where k + = |/'(r + ). It follows that near r = r + , 

r * = /y^ ln l K+(r - r+) l- 

Introducing the null coordinates u = t — r* and v = t + r* , the surface r = r+ 
appears at v — u = — oo, and we define the Kruskal coordinates U+ and V+ by 

U+ = Te~ K+u , V+ = e K + v . (5.2.6) 



5.2 Reissner-Nordstrom black hole 



137 




Figure 5.8: Kruskal patches for the Reisser- Nordstrom spacetime. 



Here, the upper sign refers to r > r + and the lower sign refers to r < r+. It is easy 
to check that / ~ —2U + V + near r = r + , so that the metric becomes 

ds 2 ~ \ dU+dV+ + r+ 2 dO 2 . 

K + 

This shows that when expressed in the coordinates (U+,V+), the metric is well 
behaved at the outer horizon. On the other hand, an exact integration for r*(r) 
would reveal that r* — ► +oo at the inner horizon, which is then located at v—u = oo, 
or U+V+ = oo; the Kruskal coordinates are singular at the inner horizon. 

The coordinates (U+, V+) should be used only in the interval r± < r < oo, where 
7*i > r_ is some cutoff radius. Inside r = r±, another coordinate system must be 
introduced. One such system is (t,r), in which the metric takes the standard form 
of Eq. (5.2.2). It is important to understand that this new coordinate patch, which 
covers the portion of the RN spacetime corresponding to the interval r_ < r < r + , 
is distinct from the original patch covering the region r > r + . And indeed, because 
/ is now negative, the new t must be interpreted as a spacelike coordinate (because 
9tt > 0), while r must be interpreted as a timelike coordinate (because g rr < 0). 

There still remains the issue of extending the spacetime beyond r = r_ , where 
the new (t, r) coordinates fail. We want to construct a new set of Kruskal coor- 
dinates, U- and V_, adapted to the inner horizon. Retracing the same steps as 
before, we have that near r = r_, the function / can be approximated by 

f(r) ~ -2«_(r - r_), 

where k_ = ||/'(r_)|. It follows that 

r* ~ — - — lnjre_(r — r_)|. 
2k_ 1 1 

With u = t — r* and v = t + r* , the surface r = r_ appears at v — u = +oo, and we 
define the new Kruskal coordinates by 

v _ = =Fe "-« i y_ = _ e -"-». (5.2.7) 

Here, the upper sign refers to r > r_ and the lower sign refers to r < r_. Then 
/ ~ — 2U-V- and the metric becomes 

ds 2 ~ %r dU-dV- + rJ dtt 2 . 

This is manifestly regular across r = r_. The new Kruskal coordinates, however, 
are singular at r = r+. 

What happens now on the other side of the inner horizon? The most noticeable 
feature is that the singularity at r = appears as a timelike surface — this is 
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markedly different from what happens inside a Schwarzschild black hole, where the 
singularity is spacelike. Because / > when r < r_, r re-acquires its interpretation 
as a spacelike coordinate; any surface r = constant < r_ is therefore a timelike 
hypersurface, and this includes the singularity. Because it is timelike, the singularity 
can be avoided by observers moving within the black hole. This is a striking new 
phenomenon, and we should think about this very carefully. 

Consider the motion of a typical observer inside a RN black hole (Fig. 5.8). 
Before crossing the inner horizon (but after going across the outer horizon), r is a 
timelike coordinate and the motion necessarily proceeds with r decreasing. After 
crossing r = r_, however, r becomes spacelike, and both types of motion (r decreas- 
ing or increasing) become possible. Our observer may therefore decide to reverse 
course, and if she does, she will avoid r = altogether. Her motion inside the 
inner horizon will then proceed with r increasing, and she will cross, once more, the 
surface r = r_. This, however, is another copy of the inner horizon, distinct from 
the one encountered previously. (Recall that there are two copies of each surface 
r = constant in a Kruskal diagram.) After entering this new r > r_ region, our 
observer notices that r has once again become timelike, and finds that reversing 
course is no longer possible: her motion must proceed with r increasing, and this 
brings her in the vicinity of another surface r = r + . Because there is no reason 
for spacetime to just stop there, yet another Kruskal patch (U+, V+) must be intro- 
duced to extend the RN metric beyond this horizon. The new Kruskal coordinates 
take over where the old patch (f/_,V_) leaves off, at the spacelike hypersurface 
r = r±. 

The ultimate conclusion to these considerations is that our observer eventually 
emerges out of the black hole, through another copy of the outer horizon, into a new 
asymptotically-flat universe. Her trip may not end there: Our observer could now 
decide to enter the RN black hole that resides in this new universe, and this entire 
cycle would repeat! It therefore appears that the RN metric describes more than 
just a single black hole. Indeed, it describes an infinite lattice of asymptotically-flat 
universes connected by black-hole tunnels. 

Such a fantastic spacetime structure is best represented with a Penrose-Carter 
diagram (Fig. 5.9). This diagram makes it clear that region bounded by the surfaces 




Figure 5.9: Penrose-Carter diagram of the Reisser-Nordstrom spacetime. 
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r = r + and r = r_ contains trapped surfaces: both ingoing and outgoing light rays 
originating from this region converge toward the singularity. The outer and inner 
horizons are therefore apparent horizons, but only the outer horizon is an event 
horizon. 

5.2.3 Radial observers in Reissner-Nordstrom spacetime. 

The discovery of black-hole tunnels is so bizarre that it should be backed up by a 
solid calculation. Here we consider the geodesic motion of a free-falling observer in 
the RN spacetime. It is assumed that the motion proceeds entirely in the radial 
direction, and that initially, it is directed inward. 

We will first work with the (v, r) coordinates, in which the metric takes the form 

ds 2 = -fdv 2 +2dvdr + r 2 dti 2 , (5.2.8) 

where / = 1 — 2M/r + Q 2 /r 2 . The observer's four-velocity is u a d a = vd v +rd r , 
where an overdot denotes differentiation with respect to proper time r. The quantity 
E = —u a t a = —u v , the observer's energy per unit mass, is a constant of the motion. 
In terms of v and f, this is given by E = fv—r. On the other hand, the normalization 
condition u a u a = — 1 gives fv 2 — 2vr = 1, and these equations imply 

h n = -{E 2 -ff'\ v in = *- {& - f)1/ \ (5.2.9) 

where the sign in front of the square root was chosen appropriately for an ingoing 
observer. 

The equation for r can also be written in the form 

r 2 +f = E 2 , (5.2.10) 

which comes with a nice interpretation as an energy equation (Fig. 5.10). Its mes- 
sage is clear: After crossing the outer and inner horizons, the observer reaches a 
turning point (f = 0) at a radius r m i n < r_ such that /(r m j n ) = E 2 . The motion, 
which initially was inward, turns outward, and the observer eventually emerges out 
of the black hole, into a new external universe. During the outward portion of the 
motion, the observer's four-velocity is given by 

r out = + (E 2 -f) 1/2 , v out = E+{E2 ~ f)1/2 , (5.2.11) 
with the opposite sign in front of the square root. 




r = r = r_ r = r + 



Figure 5.10: Effective potential for radial motion in Reisser-Nordstrom spacetime. 
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Let us examine the behaviour of v as the observer traverses a horizon. When 
the motion is inward, we have that v ~ (2E)~ X in the limit / — > 0. This means 
that v stays finite during the first crossings of the outer and inner horizons. When 
the motion is outward, v ~ 2E/f in the limit / — > 0, and this means that the 
coordinates (v,r) become singular during the second crossing of the inner horizon. 
The observer's motion cannot be followed beyond this point, unless new coordinates 
are introduced. 

Let us therefore switch to the coordinates (u,r), in which the RN metric takes 
the form 

ds 2 = -fdu 2 -2dudr + r 2 dtt 2 . (5.2.12) 

In these coordinates, and during the outward portion of the motion (after the bounce 
at r = r m j n ), the four- velocity is given by 

r out = +(E 2 -/) 1/2 , u ou t = E ~ {E2 ~ f)1/2 ■ (5.2.13) 

We have that ii ~ (2.E) -1 when / — > 0, which shows that u stays finite during the 
second crossings of the inner and outer horizons. 

We see that a large portion of the RN spacetime is covered by the two coordinate 
patches employed here (Fig. 5.11). This includes two asymptotically-flat regions 
connected by a black-hole tunnel that contains two copies of the outer horizon, and 
two copies of the inner horizon. The complete spacetime is obtained by tessellation, 
using the patches (v, r) and (u, r) as tiles; this gives rise to the diagram of Fig. 5.9. 
Because the completed spacetime contains an infinite number of black-hole tunnels, 
an infinite number of coordinate patches is required for its description. 

The presence of black-hole tunnels in the RN spacetime is now well established. 
These tunnels, of course, have a lot to do with the occurrence of a turning point in 
the motion of our free-falling observer. This is a rather striking feature of the RN 
spacetime. While turning points are a familiar feature of Newtonian mechanics, in 
this context they are always associated with the presence of an angular-momentum 
term in the effective potential: the centrifugal force is repulsive, and it prevents an 
observer from ever reaching the centre at r = 0. This, however, cannot explain what 
is happening here, because the motion was restricted from the start to be purely 
radial — there is no angular momentum present to produce a repulsive force. The 
gravitational field alone must be responsible for the repulsion, and we are forced to 
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conclude that inside the inner horizon, the gravitational force becomes repulsive! It 
is this repulsive gravity that ultimately is responsible for the black-hole tunnels. 

Such a surprising conclusion can perhaps be understood better if we recall our 
previous expression for the mass function: 

O 2 

m(r) = M - (5.2.14) 

This relation shows that m(r) becomes negative if r is sufficiently small, and clearly, 
this negative mass will produce a repulsive gravitational force. How can we explain 
this behaviour for the mass function? We recall that m(r) represents the mass inside 
a sphere of radius r. In general, this will be smaller than the total mass M = m(oo), 
because a sphere of finite radius r excludes a certain amount — equal to Q 2 /(2r) 
— of electrostatic energy. If the radius is sufficiently small, then Q 2 /(2r) > M, and 
m(r) < 0. You may check that this always occurs within the inner horizon. 

The conclusion that the RN spacetime contains black-hole tunnels is firm. Should 
we then feel confident that a trip inside a charged black hole will lead us to a new 
universe? This answer is no. The reason is that the existence of such tunnels de- 
pends very sensitively on the assumed symmetries of the RN spacetime, namely, 
staticity and spherical symmetry. These symmetries would not be exact in a realis- 
tic black hole, and slight perturbations have a dramatic effect on the hole's internal 
structure. The tunnels are therefore unstable, and they do not appear in realistic 
situations. (More will be said on this in Sec. 5.7, Problem 3.) 



5.2.4 Surface gravity 

In Sec. 5.2.2, a quantity n + = |/'(r + ) was introduced during the construction of 
Kruskal coordinates adapted to the outer horizon. We shall name this quantity the 
surface gravity of the black hole, and henceforth denote it simply by k. As we shall 
see in Sec. 5.5, the surface gravity provides an important characterization of black 
holes, and it plays a key role in the laws of black-hole mechanics. For the RN black 
hole, it is given explicitly by 



2r+ r+ 

where we have used Eq. (5.2.5). Notice that k = for an extreme RN black hole. 
On the other hand, 

K =1T7 (5.2.W) 
AM 

for a Schwarzschild black hole. 

The name "surface gravity" deserves a justification. Consider, in a static and 
spherically symmetric spacetime with metric 

ds 2 = -fdt 2 +f- 1 dr 2 +r 2 dn 2 , (5.2.17) 

a particle of unit mass held in place at a radius r. (Here, / is not necessarily 
restricted to have the RN form, although this will be the case of interest.) The 
four-velocity of the stationary particle is u a = / _1 / 2 i", and its acceleration is 
a" = u a .puP. The only nonvanishing component is a r = |/', and its magnitude is 

a(r) = (g a pa a aP) 1/2 = \ f-^ 2 f'(r). (5.2.18) 



This is the force required to hold the particle at r if the force is applied locally, at 
the particle's position. This, not surprisingly, diverges in the limit r — > r+. But 
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suppose instead that the particle is held in place by an observer at infinity, by 
means of an infinitely long, massless string. What is a^r), the force applied by 
this observer? 

To answer this we consider the following thought experiment. Let the observer 
at infinity raise the string by a small proper distance 6s, thereby doing an amount 
dWoo = (loads of work. At the particle's position, the displacement is also 6s, but 
the work done is SW = ads. (You may justify this statement by working in a local 
Lorentz frame at r.) Suppose now that the work SW is converted into radiation that 
is then collected at infinity. The received energy is redshifted by a factor / 1//2 , so 
that SEqq = f-^aSs. But energy conservation demands that the energy extracted 
be equal to the energy put in, so that SE^ = SW^. This implies 

a 00 (r)=f 1/2 a(r) = ±f(r). (5.2.19) 

This is the force applied by the observer at infinity. This quantity is well behaved 
in the limit r — > r + , and it is appropriate to call a 00 ('*+) the surface gravity of the 
black hole. Thus, 

K = a 00 (r+) = i /'(r + ). (5.2.20) 

The surface gravity is therefore the force required of an observer at infinity to hold 
a particle (of unit mass) in place at the event horizon. 

The surface gravity can also be defined in terms of the Killing vector t a . We have 
seen in Sec. 5.1.9 that the event horizon of a static spacetime is also a Killing horizon, 
so that t a is tangent to the horizon's null generators. Because t a is orthogonal to 
itself on the horizon, it is also normal to the horizon. But 3> = —t a t a = on 
the horizon, and since the normal vector is proportional to 3> a , there must exist a 
function k such that 

H%) ia = 2Kta ( 5 - 2 - 21 ) 
on the horizon. A brief calculation confirms that this k is the surface gravity: Using 
the coordinates (v, r), we have that t a d a = d v and t a dx a = dr on the horizon; with 
$ = —g vv = / we obtain $ jC( = f'd a r, which is just Eq. (5.2.21) with n = 
This calculation reveals also that the horizon's null generators are parameterized 
by v, so that t a = dx a /dv. 

Because t a is tangent to the horizon's null generators, it must satisfy the geodesic 
equation at r = r + . This comes as an immediate consequence of Eq. (5.2.21) and 
Killing's equation: On the horizon, 

t a . p lP = Kt a , (5.2.22) 

and we see that v is not an affine parameter on the generators. An affine parameter 
A can be obtained by integrating the equation dX/dv = e KV (Sec. 1.3). This gives 
A = V/k, where V = e KV is one of the Kruskal coordinates adapted to the event 
horizon — it was denoted V + in Sec. 5.2.2. It follows that on the horizon, the null 
vector 

k a = y-l t a (5.2.23) 

satisfies the geodesic equation in affine-parameter form. 

It is also possible to obtain an explicit formula for k. Because the congru- 
ence of null generators is necessarily hypersurface orthogonal, Frobenius' theorem 
(Sec. 2.4.3) guarantees that the relation 

*[a;/3*7] = 

holds on the event horizon. Using Killing's equation, this implies 

ta;pt~/ + t Tl atp + t/3- n t a = 0, 
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and contracting with t a 'P yields 
We have obtained 



^ = - l -t^t a , p , (5.2.24) 

in which it is understood that the right-hand side is evaluated at r = r + . Equations 
(5.2.21), (5.2.22), and (5.2.24) can all be regarded as fundamental definitions of the 
surface gravity; these are of course all equivalent. 



5.3 Kerr black hole 

5.3.1 The Kerr metric 

A solution to the Einstein field equations describing a rotating black hole was dis- 
covered by Roy Kerr in 1963. (There is also a solution to the Einstein-Maxwell 
equations that describes a charged, rotating black hole. It is known as the Kerr- 
Newman solution, and it is described in Sec. 5.7, Problem 8.) As we shall see, 
the Kerr metric can be written in a number of different ways. In the standard 
Boyer-Lindquist coordinates, it is given by 

, 2 / 2Mr\ , 2 4Marsin 2 <9 , ,, S . 2 „ , , 2 p 2 , , , 

ds 2 = - 1 5- dt 2 = dtdcb + sin 2 Odd? + '-- dr 2 + p 2 d6 2 

\ p J p p A " 

(5.3.1) 



where 



■ 4^ dt 2 + ^ sin 2 e(d<f>-udt) 2 + ^- dr 2 + p 2 d6 2 , 



p 2 = r 2 + a 2 cos 2 6, A = r 2 - 2Mr + a 2 



S = (r + a ) - a A sin 8, lu = = — - — . 



(5.3.2) 



The Kerr metric is stationary and axially symmetric; it therefore admits the Killing 
vectors t a = dx a /dt and 4> a = dx a /84>. It is also asymptotically flat. The Komar 
formulae (Sec. 4.3) confirm that M is the spacetime's ADM mass, and show that 
J = aM is the angular momentum (so that a is the ratio of angular momentum to 
mass). 

The components of the inverse metric are 



n tt_ S m_ 2M " r n **_ A-a 2 sin 2 fl 

9 ' p 2 A' 9 ~ p 2 A ' 9 p 2 Asm 2 6 . 



(5.3.3) 



grr = A ,/><> = _L 

/9 Z /9 Z 



The metric and its inverse have singularities at A = and p 2 = 0. To distinguish 
between coordinate and curvature singularities, we examine the squared Riemann 
tensor of the Kerr spacetime: 

R «W Rafji5 = 48M 2 (r 2 - a 2 cos 2 y - 16a 2 r 2 cos 2 g) (g;u) 
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This reveals that the singularity of the metric at A = is just a coordinate singu- 
larity, but that the Kerr spacetime is truly singular at p 2 = 0. The nature of the 
curvature singularity will be clarified in Sec. 5.3.8. 

Various properties of the Kerr spacetime will be examined in the following sub- 
sections. To facilitate this discussion we will introduce three families of observers: 
zero-angular- momentum observers (ZAMOs), static observers, and stationary ob- 
servers. 

5.3.2 Dragging of inertial frames: ZAMOs 

ZAMOs are freely moving observers with zero angular momentum: if u a is the 
four-velocity, then L = u a <jf = 0. This implies g^t + g<t><t>4> = 0, where an overdot 
indicates differentiation with respect to proper time r. Using Eqs. (5.3.1), this 
translates to 

0=^=w, (5.3.5) 

and we see that ZAMOs possess an angular velocity equal to w = —gt<t>/9<t><t>- This 
angular velocity increases as the observer approaches the black hole, and it goes in 
the same direction as the hole's own rotation — the ZAMOs rotate with the black 
hole. This striking property of the Kerr black hole, which in fact is shared by all 
rotating bodies, is called the dragging of inertial frames (see Sec. 3.10). At large 
distances from the black hole, uj ~ 2J/r 3 , and the dragging disappears completely 
at infinity. 

5.3.3 Static limit: static observers 

We now consider static observers in the Kerr spacetime. Such observers have a 
four- velocity proportional to the Killing vector t a : 

u a = -yt a , (5.3.6) 

where the factor 7 = (—g a /3t a t l3 )~ 1 ^ 2 ensures that the four- velocity is properly 
normalized. Because these observers must be held in place by an external agent (a 
rocket engine, for example), the motion is not geodesic. 

Static observers cannot exist everywhere in the Kerr spacetime. This can be 
seen from the fact that t a is not everywhere timelike, but becomes null when 7~ 2 = 
— g tt = 0; when this occurs, Eq. (5.3.6) breaks down. The static limit is therefore 
described by g tt = or, after using Eqs. (5.3.1) and (5.3.2), r 2 — 2Mr+a 2 cos 2 6 = 0. 
Solving for r reveals that the static limit is located at r = r s \ (6) , where 

r si (6) = M + y/M 2 - a 2 cos 2 6. (5.3.7) 

Thus, observers cannot remain static when r < r s i(#), even if an arbitrarily large 
force is applied. Instead, the dragging of inertial frames compels them to rotate 
with the black hole. As we shall see, the static limit does not coincide with the 
hole's event horizon. The finite region between the horizon and the static limit is 
called the ergosphere of the Kerr spacetime (Fig. 5.12). 

5.3.4 Event horizon: stationary observers 

We now consider observers moving in the <f> direction with an arbitrary, but uniform, 
angular velocity d<f>/dt = fi. Because such observers do not perceive any time 
variation in the black hole's gravitational field, they are called stationary observers. 
They move with a four-velocity 



u a = 1 (t a + n<p a ), 



(5.3.8) 
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where t a + £l<j) a is a Killing vector for the Kerr spacetime, and 7 a new normalization 
factor given by 

= -g<f><f>(tt 2 -2utt+g tt /g <f><)> ), 

where w = -g t </,/gtt- 

Stationary observers cannot exist everywhere in the Kerr spacetime: the vector 
t a + Q,(j) a must be timelike, and this fails to be true when 7~ 2 is nonpositive. It is 
easy to check that the condition 7~ 2 > gives rise to the following requirement on 
the angular velocity: 

< Q < (5.3.9) 

where fi± = w± y^w 2 - gu/g<p<p- After some algebra, using Eqs. (5.3.1) and (5.3.2), 
this reduces to 

A 1 / 2 -? 2 

n±=w±-— (5.3.10 
X sin 

A stationary observer with f2 = is a static observer, and we already know that 
static observers exist only outside the static limit. It must therefore be true that 
fi_ changes sign at r = r s \(6). This is confirmed by a few lines of algebra, using 
Eqs. (5.3.7) and (5.3.10). As r decreases further from r s i(#), fi_ increases while fi + 
decreases. Eventually we arrive at the situation fi_ = fi + , which implies fi = ui; at 
this point the stationary observer is forced to move around the black hole with an 
angular velocity equal to ui. This occurs when A = 0, or r 2 — 2Mr + a 2 = 0, and 
the largest solution is r = r + , where 

r+ = M + y/M 2 -a 2 . (5.3.11) 

Notice that the roots of A = are real if and only if a < M, or J < M 2 : there 
is an upper limit on the angular momentum of a black hole. Kerr black holes with 
a = M are said to be extremal. For a > M, the Kerr metric describes a naked 
singularity. 

The vector t a + Q,<jf becomes null at r = r + , and stationary observers can- 
not exist inside this surface, which we identify with the black hole's event horizon 
(Fig. 5.12). The quantity 

^ = w ( r +) = ^T^ (5 - 3 - 12) 
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is then interpreted as the angular velocity of the black hole. Stationary observers 
just outside the horizon have an angular velocity equal to fi^ — they are in a state 
of corotation with the black hole. 

To confirm that r = r + is truly the event horizon, we use the property that in a 
stationary spacetime, the event horizon is also an apparent horizon — a surface of 
zero expansion for a congruence of outgoing null geodesies orthogonal to the surface. 
The event horizon must therefore be a null, stationary surface. Now, the normal to 
any stationary surface must be proportional to d a r, and such a surface will be null 
if g a P(d a r)(dpr) = g rr = 0. Using Eq. (5.3.3), this implies 

A = r 2 - 2Mr + a 2 = 0. (5.3.13) 

The largest solution, r = r + , designates the event horizon. The other root, 

r_ = M - VM 2 -a 2 , (5.3.14) 

describes the black hole's inner apparent horizon, which is analogous to the inner 
horizon of the Reissner-Nordstrom black hole. 
We have found that the vector 

r = t a + n H c{> a (5.3.15) 

is null at the event horizon. It is tangent to the horizon's null generators, which wrap 
around the horizon with an angular velocity . Because it is a linear combination 
of two Killing vectors, £ a is also a Killing vector, and the event horizon of the Kerr 
spacetime is a Killing horizon. Notice an important difference between stationary 
and static black holes: For a static black hole, t a becomes null at the event horizon; 
for a stationary black hole, t a is null at the static limit, and becomes null at the 
event horizon. 

5.3.5 The Penrose process 

The fact that t a is spacelike in the ergosphere — the region r + < r < r s \(6) — 
implies that the (conserved) energy E = —p a t a of a particle with four-momentum 
p a can be of either sign. Particles with negative energy can therefore exist in the 
ergosphere, but they would never be able to escape from this region. (Note that 
E < refers to the energy that would be measured at infinity if the particle could 
be brought there. Any local measurement of the particle's energy inside the static 
limit would return a positive value.) 

It is easy to elaborate a scenario in which negative-energy particles created in 
the ergosphere are used to extract positive energy from a Kerr black hole. Imagine 
that a particle of energy E\ > comes from infinity and enters the ergosphere. 
There, it decays into two new particles, one with energy — E 2 < 0, the other with 
energy E 3 = E\ + Ei > E\. While the negative-energy particle remains within 
the static limit, the positive-energy particle escapes to infinity, where its energy is 
extracted. Because E 3 is larger than the energy of the initial particle, the black 
hole must have given off some of its own energy. This is the Penrose process, by 
which some of the energy of a rotating black hole can be extracted. 

The Penrose process is self-limiting: only a fraction of the hole's total energy 
can be tapped. Suppose that in order to exploit the Penrose process, a rotating 
black hole is made to absorb a particle of energy E = —p a t a < and angular 
momentum L = p a <t> a - Because the Killing vector £ a = t a + VlH<j> a is timelike just 
outside the event horizon, the combination E — VIhL must be positive; otherwise 
the particle would not be able to penetrate the horizon. Thus L < E/Hh, and L 
must be negative if E < 0. The black hole will therefore lose angular momentum 
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during the Penrose process. Eventually the hole's angular momentum will go to 
zero, the ergosphere will disappear, and the Penrose process will stop. We might 
say that only the hole's rotational energy can be extracted by the Penrose process. 

Note that in a process by which a black hole absorbs a particle of energy E (of 
either sign) and angular momentum L, its parameters change by amounts 6M = E 
and 8 J = L. Since E — VIhL must be positive, we have 

5M-tt H 5J > 0. 

As we shall see, this inequality is a direct consequence of the first and second laws 
of black-hole mechanics. 

5.3.6 Principal null congruences 

The Boyer-Lindquist coordinates, like the Schwarzschild coordinates, are singular 
at the event horizon: While a trip down to the event horizon requires a finite proper 
time, the interval of coordinate time t is infinite. Moreover, because the angular 
velocity d(f>/dt stays finite at the horizon, <j> also increases by an infinite amount. 
We therefore need another coordinate system to extend the Kerr metric beyond the 
event horizon. It is advantageous to tailor these new coordinates to the behaviour 
of null geodesies. The two congruences considered here (which are known as the 
principal null congruences of the Kerr spacetime) are especially simple to deal with; 
we will use them to construct new coordinates for the Kerr metric. 

It is a remarkable feature of the Kerr metric that the equations for geodesic 
motion can be expressed in a decoupled, first-order form. These equations involve 
three constants of the motion: the energy parameter E, the angular-momentum 
parameter L, and the "Carter constant" J2. (This last constant appears because of 
the existence of a Killing tensor. This is explained in Sec. 5.7, Problem 4, which also 
provides a derivation of the geodesic equations.) For null geodesies, the equations 
are 

p 2 i = -a(aE sin 2 6 -L) + (r 2 +a 2 )P/A, 
p 2 r = ±VR, 

P 2 e = ±Ve, 

p 2 4> = -(aE - L/ sin 2 6) + oP/A, 

in which an overdot indicates differentiation with respect to the affine parameter A, 
and 



P = E(r 2 +a 2 )-aL, 



(L-aEf +3 



R = P 2 - A 

6 = ^ + cos 2 6i(o 2 J B 2 -L 2 / sin 2 61). 

We simplify these equations by making the following choices: 

L = aE sin 2 6, £ = -(L - aE) 2 = -(aE cos 2 6) 2 . 

It is easy to check that these imply © = 0, so that our geodesies move with a 
constant value of 6. We also have P = E p 2 and R= (E p 2 ) 2 , which gives 

i = E(r 2 +a 2 )/A, r = ±E, = 0, <j) = aE/A. 

The constant E can be absorbed into the affine parameter A. We obtain an ingoing 
congruence by choosing the negative sign for f, and we shall use l a to denote its 
tangent vector field: 

l a d a = r —^-d t -d T + ^d <p . (5.3.16) 
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Choosing instead the positive sign gives an outgoing congruence, with 



k a d a = — - — d t + d r + — 



(5.3.17) 



as its tangent vector field. 

To give the simplest description of the ingoing congruence, we introduce new 
coordinates v and tp defined by 



v = t + r* , tp = 4> + r', 



(5.3.18) 



where 



and 



= r + 



fr 2 +a 2 J 
J— dr 
Mr+ 
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- 1 



Vm 2 
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r 



- 1 



• dr 



In 



r — r + 



(5.3.19) 



(5.3.20) 



A "' 2y/M 2 - a 2 

It is easy to check that in these coordinates, l r = —1 is the only nonvanishing 
component of the tangent vector. This means that v and tp (as well as 9) are 
constant on each of the ingoing null geodesies, and that — r is the affine parameter. 

The simplest description of the outgoing congruence is provided by the coordi- 
nates (u,r,6,x), where 

u = t-r*, x = 4>-r i . (5.3.21) 

In these coordinates, k r = +1 is the only nonvanishing component of the tangent 
vector. This shows that u and x ( as wen as are constant along the outgoing null 
geodesies, and that r is the affine parameter. 

The Kerr metric can be expressed in either one of these new coordinate systems. 
While the coordinates (v, r, 6, tp) are well behaved on the future horizon but singular 
on the past horizon, the coordinates (it, r, 6, x) are well behaved on the past horizon 
but singular on the future horizon. For example, a straightforward computation 
reveals that after a transformation to the ingoing coordinates, the Kerr metric 
becomes 



= - 1- 



2Mr\ 2 2 

— 2~ ] dv +2 dvdr — 2a sin 9 drdift 



4Mar sin 2 



dvdip + 



S 

"^2 



sin 2 6 dtp 2 +p 2 d6 2 . 



(5.3.22) 



These coordinates produce an extension of the Kerr metric across the future horizon. 
Several coordinate patches, both ingoing and outgoing, are required to cover the 
entire Kerr spacetime, whose causal structure is very similar to that of the Reissner- 
Nordstrom spacetime. We shall return to this point in Sec. 5.3.9. 



5. 3. 7 Kerr-Schild coordinates 

Another useful set of coordinates for the Kerr metric is (t',x,y,z), the pseudo- 
Lorentzian Kerr-Schild coordinates in terms of which the metric takes a particularly 
interesting form. These are constructed as follows. 

We start with Eq. (5.3.22) and separate out the terms that are proportional to 
M. After some algebra, we obtain 

ds 2 = -dv 2 + 2 dvdr- 2a sin 2 6 drdtp + (r 2 + a 2 ) sin 2 6dtp 2 +p 2 d6 2 
2Mr 

H — (dv — asm 2 6 dtp) 2 . 
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The terms that do not involve M have a simple interpretation: they give the metric 
of flat spacetime in a rather strange coordinate system. The rest of the line element 
can be written neatly in terms of l a : Recalling that V = — 1 is the only nonvanishing 
component of l a , we find that 

— l a dx a = dv — a sin 2 6 dip, 

and the line element becomes 

2 Mr o 
ds 2 = (ds 2 )^ + (l a dx a )\ (5.3.23) 

The Kerr metric can therefore be expressed as 

2Mr 

9 a p = Vap + —5- Up, (5.3.24) 

where rj a p is the metric of flat spacetime in the coordinates (v,r,0,ip). 

Equation (5.3.24) gives us the Kerr metric in a rather attractive form. Any 
metric that can be written as g a p = rj a p + Hl a lp, where H is a scalar function and 
l a a null vector field, is known as a Kerr-Schild metric. It is by adopting such an 
expression that Kerr discovered his solution in 1963. (Some general aspects of the 
Kerr-Schild decomposition are worked out in Sec. 5.7, Problem 5.) 

The next order of business is to find the coordinate transformation that brings 
r) a p to the standard Minkowski form. The answer is 

x + iy = (r + ia) sinde 1 ^, z = rcos6, t' = v - r. (5.3.25) 

Going through the necessary algebra does indeed reveal that in these coordinates, 

(ds 2 ) flat = -dt' 2 + dx 2 + dy 2 + dz 2 . (5.3.26) 

It is easy to work out the components of l a in this coordinate system. Because the 
null geodesies move with constant values of v, 6, and tp, we have that x + iy = 
— smOe 1 ^, z = — cos#, and i' = 1, where we have used r = —1. Lowering the 
indices is a trivial matter (see Sec. 5.7, Problem 5), and expressing the right-hand 
sides in terms of the new coordinates gives 

, , r. ,1 rx + ay , ry — ax , z , . „ „„. 

-l a dx a = dt' + -5 ^-dx+-^ -dy + -dz. 5.3.27 

The quantity r must now be expressed in terms of x, y, and z. Starting with 
x 2 +y 2 = (r 2 + a 2 ) sin 2 0, it is easy to show that 

r 4 - (x 2 +y 2 + z 2 - a 2 )r 2 - a 2 z 2 = 0, (5.3.28) 

which may be solved for r(x,y,z). Equations (5.3.23), (5.3.26)-(5.3.28) give the 
explicit form of the Kerr metric in the Kerr-Schild coordinates. 

5. 3. 8 The nature of the singularity 
We have seen that the Kerr spacetime possesses a curvature singularity at 

p 2 =r 2 + a 2 cos 2 6 = 0. 

According to this equation, the singularity occurs only in the equatorial plane 
{6 = 7r/2), at r = 0. The Kerr-Schild coordinates can help us make sense of 
this statement. The relations x 2 + y 2 = (r 2 +a 2 ) sin 2 6 and z = rcosO indicate that 
the "point" r = corresponds in fact to the entire disk x 2 + y 2 < a 2 in the plane 
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z = 0. The points interior to the disk correspond to angles such that sin 2 6 < 1. 
The boundary, 

2,2 2 

corresponds to the equatorial plane, and this is where the Kerr metric is singular. 
The curvature singularity of the Kerr spacetime is therefore shaped like a ring. This 
singularity can be avoided: Observers at r = can stay away from the equatorial 
plane, and they never have to encounter the singularity; such observers end up going 
through the ring. 



5. 3. 9 Maximal extension of the Kerr spacetime 

We have already constructed coordinate systems that allow the continuation of the 
Kerr metric across the event horizon. We now complete the discussion and show 
how the spacetime can also be extended beyond the inner horizon. For simplicity, 
we shall work with the section of the Kerr spacetime obtained by setting 6 = 0. 
This is the rotation axis, and because the Kerr metric is not spherically symmetric, 
this does represent a loss of generality. 

Going back to Eq. (5.3.1) and the original Boyer-Lindquist coordinates, we find 
that when 6 = 0, the Kerr metric reduces to 

i-J*^) & + + 

r 2 +a 2 , \ ( . r 2 + a 2 



r 2 + a 2 



r A + a 

dt - - — dr ) I dt + - — dr 



or 

ds 2 = -fdudv, (5.3.29) 
where u = t — r* and v = t + r* are the coordinates of Sec. 5.3.6. Here, 

f - A _ (r-r+)(r-r-) 

J = — 2 T~, — 2 ' (5.3.30) 



and r± = M ± y/M 2 — a 2 denote the positions of the outer and inner horizons, 
respectively. The metric of Eq. (5.3.29) is extremely simple, and the construction 
of Kruskal coordinates for the 6 = section of the Kerr spacetime proceeds just as 
for the Reissner-Nordstrom (RN) black hole (Sec. 5.2.2). 

We first consider the continuation of the metric across the event horizon. Near 
r = r + , Eq. (5.3.30) can be approximated by 

/ ~ 2«+(r - r+), 

where re + = |/'(r + ). It follows that 



r * = /y^2^ ln l K+(r - r+) l 



and / ~ ±2e 2K+r * = ±2e K +^ the upper sign refers to r > r + and the lower 
sign to r < r + . Introducing the new coordinates 

U+ = Te~ K+u , V+ = e K+v , (5.3.31) 

we find that near r = r + , the Kerr metric admits the manifestly regular form 
ds 2 ~ -2K+ 2 dU+dV+. 

Just as for the RN spacetime, the coordinates U+ and V+ are singular at the 
inner horizon, and another coordinate patch is required to extend the Kerr metric 
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u+ v+ u_ v_ 




Figure 5.13: Kruskal patches for the Kerr spacetime. 



beyond this horizon (Fig. 5.13). The procedure is now familiar. Near r = r_ we 
approximate Eq. (5.3.30) by / ~ — 2«_(r - r_), where k_ = ||/'(r_)|, so that 
/ ~ ^2e~ 2K ~ r ' = ^2e K -( u ~ v \ The appropriate coordinate transformation is now 

U = z Fe "-« ! v+ = -e~ K - v , (5.3.32) 

and the metric becomes ds 2 ~ — 2kT 2 dU_dV-. 

Just as for the RN spacetime, another copy of the outer horizon presents itself 
in the future of the inner horizon, and another Kruskal patch is required to ex- 
tend the spacetime beyond this new horizon. This continues ad nauseam, and we 
see that the maximally extended Kerr spacetime represents an infinite succession 
of asymptotically-flat universes connected by black-hole tunnels. There is more, 
however. It is easy to check that in a spacetime diagram based on the (C/_,V_) 
coordinates, the surface r = is represented by [7_V_ = —1. This is a timelike 
surface, and on the rotation axis, this surface is nonsingular. The Kerr spacetime 
can therefore be extended beyond r = 0, into a region in which r adopts negative 
values. This new region has no analogue in the RN spacetime; it contains no hori- 
zons, and it becomes flat in the limit r — ► — oo. Observers in this region interpret 
the Kerr metric as describing the gravitational field of a (naked) ring singularity. 
You should be able to convince yourself that this singularity has a negative mass. 

The maximally extended Kerr spacetime can be represented by a Penrose-Carter 
diagram (Fig. 5.14). The resulting causal structure is extremely complex. It should 
be kept in mind, however, that the interior of a Kerr black hole is subject to the 
same instability as that of a RN black hole (see Sec. 5.2.3 and Sec. 5.7, Problem 3). 
The tunnels to other universes, and the regions of negative r, are not present inside 
physically realistic black holes. 

5.3.10 Surface gravity 

As was pointed out in Sec. 5.3.4, the vector 

£ a =t a +n H <j> a , (5.3.33) 

where £Ih is given by Eq. (5.3.12), is null at the event horizon, and is in fact tangent 
to the horizon's null generators. From the same arguments as those presented in 
Sec. 5.2.4, the black hole's surface gravity k can be defined by 

H%) ia = 2k&, (5-3-34) 

or by 

= < a ' ( 5 - 3 - 35 ) 
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Figure 5.14: Penrose-Carter diagram of the Kerr spacetime. 



or finally, by 

« 2 = -^ aU W (5-3.36) 

These definitions are all equivalent. 

Let us use Eq. (5.3.34) to calculate the surface gravity. The norm of £ a is given 

by 

fBf S sin 2 fl , 2 p 2 A 

Z% = — (Off - w) 2 - 

and differentiation yields 

H%) ;a = ^A, a 

on the horizon, at which w = fi^ and A = 0. We have that A jC( = 2(r + — M) d a r 
and £ a = (1 — aQ,H sin 2 #) 9 a r on the horizon, and a few lines of algebra reveal that 
the surface gravity is 

«=^ = ^?- (5-3.37) 
r + z + gt + gt 

Notice that this is the same quantity that was denoted k + in Sec. 5.3.9. Notice also 
that k = for an extreme Kerr black hole. And finally, notice that in the general 
does not depend on 6 — the surface gravity is uniform on the event horizon. 
We shall return to this remarkable fact in Sec. 5.5.1. 

5.3.11 Bifurcation two-sphere 

In the coordinates (v, r, 6, ip) which are regular on the event horizon, £ a d a = d v + 
Vlnd^. This shows that the horizon's null generators are parameterized by the 
advanced-time coordinate v, but as Eq. (5.3.35) reveals, v is not affine. An affine 
parameter A is obtained by integrating 

_ e K V 

dv 

so that k\ = e KV = V. It follows that on the horizon, the vector k a = V _1 £ a 
satisfies the geodesic equation in affine-parameter form: = 0. [This vector is 
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not equal to the k a introduced in Sec. 5.3.6. It is easy to check that these vectors 
are related by fc" ew = |Afc" ld / (r 2 +a 2 ), where the right-hand side is to be evaluated 
on the horizon.] If n ^ and the event horizon is geodesically complete (in the 
sense that the null generators can be extended arbitrarily far into the past), the 
relation 

C = Vk a (5.3.38) 

implies that £ a = at V = 0. This defines a closed two-surface called the bifurcation 
two-sphere of the Kerr spacetime. The conditions are sometimes violated: The event 
horizon of a black hole formed by gravitational collapse is not geodesically complete, 
because the horizon was necessarily formed in the finite past; and as we have seen, 
the surface gravity of an extreme Kerr black hole (for which M = a) vanishes. In 
either one of these situations, the bifurcation two-sphere does not exist. 

5.3.12 Smarr's formula 

There exists a simple algebraic relation between the black-hole mass M, its angular 
momentum J = Ma, and its surface area A. This is defined by 

A = I v^d 2 <9, (5.3.39) 

where J%? is a two-dimensional cross section of the event horizon, described by 
v = constant, r = r + , < < n, and < ip < 2n. From Eq. (5.3.22) we find that 
the induced metric is given by 

(jab d9 A de B = p 2 de 2 + 4 sin 2 6 dip 2 , 
/'- 

so that y/ad 2 = \/S sm9 d6dip = (r + 2 + a 2 ) sm6d6dip. Integration yields 

A = 47r(r+ 2 +a 2 ). (5.3.40) 

The algebraic relation, which was discovered by Larry Smarr in 1973, reads 

kA 

M = 2tt H J+— , (5.3.41) 

where £Ih is the hole's angular velocity and k its surface gravity. Smarr's formula 
is established by straightforward algebra: Substituting Eqs. (5.3.12), (5.3.37), and 
(5.3.40) into the right-hand side of Eq. (5.3.41) reveals that it is indeed equal to 
M. We will generalize Smarr's formula, and present an alternative derivation, in 
Sec. 5.5.2. 

5.3.13 Variation law 

It is clear that the surface area of a black hole is a function of its mass and angular 
momentum: A = A(M,J). Suppose that a black hole of mass M and angular 
momentum J is perturbed so that its parameters evolve to M + 5M and J + 5 J. 
(For example, the black hole might absorb a particle, as was considered in Sec. 5.3.5.) 
How does the area change? There exists a simple formula relating 6 A to the changes 
in mass and angular momentum. It is 

— 6A = 6M-tt H 6 J. (5.3.42) 

To derive this we start with Eq. (5.3.40), which immediately implies 



— = r + 6r + + a 6a. 
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But the horizon radius r + depends on M and a; the defining relation is r + 2 — 
2Mr + + a 2 = 0, and this gives us 

(r + - M) 5r + = r + 5M - a da. 

This result can be substituted into the preceding expression for 5A. The final step 
is to relate a to the hole's angular momentum J; we have that a = J/M, and this 
implies M 6a = 6 J — aSM. Combining these results, we arrive at Eq. (5.3.42) after 
involving Eqs. (5.3.12) and (5.3.37). 

In Sec. 5.3.5 we found that the right-hand side of Eq. (5.3.42) must be positive. 
What we have, therefore, is the statement that the surface area of a Kerr black 
hole always increases during a process by which it absorbs a particle. This is a 
restricted version of the second law of black-hole mechanics, to which we shall 
return in Sec. 5.5.4. 

5.4 General properties of black holes 

The Kerr family of solutions to the Einstein field equations plays an extremely 
important role in the description of black holes, but this does not mean that all 
black holes are Kerr black holes. For example, a black hole accreting matter is not 
stationary, and a stationary hole is not a Kerr black hole if it is tidally distorted 
by nearby masses. In this section we consider those properties of black holes that 
are quite general, and not specific to any particular solution to the Einstein field 
equations. 

5.4-1 General black holes 

A spacetime containing a black hole possesses two distinct regions, the interior and 
exterior of the black hole; they are distinguished by the property that all exter- 
nal observers are causally disconnected from events occurring inside. Physically 
speaking, this corresponds to the fact that once she has entered the black hole, an 
observer can no longer send signals to the outside world. 

These fundamental notions can be cast in mathematical terms. Consider an 
event p and the set of all events that can be reached from p by future-directed 
curves, either timelike or null (Fig. 5.15). This set is denoted J + (p), and is called 
the causal future of p. A similar definition can be given for its causal past, J~(p)- 
These definitions can be extended to whole sets of events: If S is such a set, then 
J + {S) is the union of the causal futures of all the events p contained in 5; a similar 
definition can be given for J~ (S) . 

Loosely speaking, a spacetime contains a black hole if there exist null geodesies 
that never reach future null infinity, denoted These originate from the black- 




Figure 5.15: Causal future and past of an event p. 
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hole interior, a region characterized by the very fact that all future-directed curves 
starting from it fail to reach Thus, events lying within the black-hole interior 
cannot be in the causal past of The black-hole region B of the spacetime 

manifold is therefore the set of all events p that do not belong to the causal past 
of future null infinity: 

B = Jt-J-(S + ). (5.4.1) 
The event horizon H is then defined to be the boundary of the black-hole region: 

H = dB = d(j-(J r+ )). (5.4.2) 

The two-dimensional surface obtained by intersecting the event horizon with a 
spacelike hypersurface S is denoted Jff; it is called a cross section of the horizon. 

Because the event horizon is a causal boundary, it must be a null hypersurface. 
Penrose (1968) was able to establish that the event horizon is a null hypersurface 
generated by null geodesies that have no future end points. This means that: (i) 
when followed into the past, a generator may, but does not have to, leave the 
horizon; (ii) once a generator has entered the horizon, it cannot leave; (iii) two 
generators can never intersect, except possibly when they both enter the horizon; 
and finally, (iv) through every point on the event horizon, except for those at which 
new generators enter, there passes one and only one generator. It should be clear 
that the entry points into the event horizon are caustics of the congruence of null 
generators (Fig. 5.16). 

The black-hole region typically contains trapped surfaces, closed two-surfaces S 
with the property that for both ingoing and outgoing congruences of null geodesies 
orthogonal to S, the expansion is negative everywhere on S. (Exceptions are the 
extreme cases of Kerr, Kerr-Newman, or Reissner-Nordstrom black holes, which do 
not contain any trapped surfaces.) The three-dimensional boundary of the region 
of spacetime that contains trapped surfaces — the trapped region — is the trapping 
horizon, and its two-dimensional intersection with a spacelike hypersurface X is 
called an apparent horizon. The apparent horizon is therefore a marginally trapped 
surface — a closed two-surface on which one of the congruences has a zero expansion. 
The apparent horizon of a stationary black hole typically coincides with the event 
horizon. In dynamical situations, however, the apparent horizon always lies within 
the black-hole region (Fig. 5.16), unless the null energy condition is violated. (Refer 
back to Sec. 5.1.8 for a specific example.) 




Figure 5.16: Event and apparent horizons of a black-hole spacetime. 
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The presence of trapped surfaces inside a black hole unequivocally announces the 
formation of a singularity; this is the content of the beautiful singularity theorems of 
Penrose (1965) and Hawking and Penrose (1970). The theorems rely on some form 
of energy condition (null for Penrose's original formulation, strong for the Hawking- 
Penrose extension) and require additional technical assumptions. The nature of the 
"singularity" predicted by the theorems is rather vague: The singularity is revealed 
by the presence inside the black hole of at least one incomplete timelike or null 
geodesic, but the physical reason for incompleteness is not identified. In all known 
examples satisfying the conditions of the theorems, however, the black hole contains 
a curvature singularity at which the Riemann tensor diverges. 

5.4-2 Stationary black holes 

It was established by Hawking in 1972 that if a black hole is stationary, then it 
must be either static or axially symmetric. This means that the spacetime of a 
(stationary) rotating hole is necessarily axially symmetric, and that it must ad- 
mit two Killing vectors, t a and <f> a . Hawking was also able to show that a linear 
combination of these vectors, 

Z a =F+Sl B <lt a , (5.4.3) 

is null at the event horizon. Here, £Ih is the hole's angular velocity, which vanishes 
if the spacetime is nonrotating (and therefore static). Thus, the event horizon is a 
Killing horizon, and £ a is tangent to the horizon's null generators: £ a = dx a /dv, 
with v denoting the (non-affine) parameter on the geodesies. The hole's surface 
gravity k is then defined by the relation 

= < a ' ( 5 - 4 - 4 ) 

which holds on the horizon. We will prove in Sec. 5.5.2 that k is constant along 
the horizon's null generators. (Indeed, it is uniform over the entire horizon.) This 
means that we can replace v by an affine parameter A = V/n, where V = e KV 
(Sec. 5.3.10). Then 

k a = V- 1 ^ (5.4.5) 

satisfies the geodesic equation in affine-parameter form. It follows that if k ^ and 
the horizon is geodesically complete (in the sense that its generators never leave 
the horizon when followed into the past), then there exists a two-surface, called the 
bifurcation two-sphere, on which £ a = 0. 

The properties of stationary black holes listed here were all encountered before, 
during our description of the Kerr solution. It should be appreciated, however, that 
Eqs. (5.4.3)-(5.4.5) hold by virtue of the sole fact that the black hole is stationary; 
these results do not depend on the specific details of a particular metric. 

The observation that a stationary black hole must be axially symmetric if it is 
rotating seems puzzling. After all, it should be possible to place a nonsymmetrical 
distribution of matter outside the hole, and let it tidally distort the event horizon 
in a nonsymmetrical manner. This, presumably, would produce a black hole that 
is still stationary and rotating, but not axially symmetric. Hawking and Hartle 
(1972) have shown that this, in fact, is false! The reason is that such a distribution 
of matter would impart a torque on the black hole, which would force it to spin 
down to a nonrotating (and static) configuration. Thus, such a situation would not 
leave the black hole stationary. 

Additional properties of stationary black holes can be inferred from Raychaud- 
huri's equation, 

jo -i 

^ = -- 6 2 - a^a af} - R af) k a k^ (5.4.6) 
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in which we have put u> a p = to reflect the fact that the congruence of null genera- 
tors is necessarily hypersurface orthogonal. The event horizon will be stationary if 
9 and d9/d\ are both zero. Using the Einstein field equations and the null energy 
condition, Eq. (5.4.6) implies that the stress-energy tensor must satisfy 

T a pZ*e = (5.4.7) 

on the horizon. This means that matter cannot be flowing across the event hori- 
zon; if it were, the generators would get focused and the black hole would not be 
stationary. Raychaudhuri's equation also implies 

o afi = 0; (5.4.8) 

the null generators of the event horizon have a vanishing shear tensor. 



5.4-3 Stationary black holes in vacuum 

In the absence of any matter in their exterior, stationary black holes admit an 
extremely simple description. 

If the black hole is static, then it must be spherically symmetric, and it can only 
be described by the Schwarzschild solution. This beautiful uniqueness theorem, 
the first of its kind, was established by Werner Israel in 1967. It implies that in 
the absence of angular momentum, complete gravitational collapse must result in 
a Schwarzschild black hole. This seems puzzling, because the statement is true 
irrespective of the initial shape of the progenitor, which might have been strongly 
nonspherical. The mechanism by which a nonspherical star shakes off its higher 
multipole moments during gravitational collapse was elucidated by Richard Price 
in 1972: These multipole moments are simply radiated away, either out to infinity 
or into the black hole. After the radiation has faded away, the hole settles down to 
its final, spherical state. 

If the black hole is axially symmetric, then it must be a Kerr black hole. This 
extension of Israel's uniqueness theorem was established by Brandon Carter (1971) 
and D.C. Robinson (1975). 

The black-hole uniqueness theorems can be generalized to include situations in 
which the black hole carries an electric charge. If the black hole is static, then it 
must be a Reissner-Nordstrom black hole (Israel, 1968). If it is axially symmetric, 
then it must be a Kerr-Newman black hole (Mazur, 1982; Bunting, unpublished). 

We see that a black hole in isolation can be described, uniquely and com- 
pletely, by just three parameters: its mass, angular momentum, and charge. No 
other parameter is required, and this remarkable property is at the origin of John 
A. Wheeler's famous phrase, "a black hole has no hair". Chandrasekhar (1987) was 
well justified to write: 

Black holes are macroscopic objects with masses varying from a few solar 
masses to millions of solar masses. To the extent that they may be consid- 
ered as stationary and isolated, to that extent, they are all, every single one 
of them, described exactly by the Kerr solution. This is the only instance 
we have of an exact description of a macroscopic object. Macroscopic ob- 
jects, as we see them all around us, are governed by a variety of forces, 
derived from a variety of approximations to a variety of physical theories. 
In contrast, the only elements in the construction of black holes are our 
basic concepts of space and time. They are, thus, almost by definition, the 
most perfect macroscopic objects there are in the universe. And since the 
general theory of relativity provides a single unique two-parameter family 
of solutions for their descriptions, they are the simplest objects as well. 
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5.5 The laws of black- hole mechanics 

In 1973, Jim Bardeen, Brandon Carter, and Stephen Hawking formulated a set of 
four laws governing the behaviour of black holes. These laws of black-hole mechan- 
ics bear a striking resemblance to the four laws of thermodynamics. While this 
analogy was at first perceived to be purely formal and coincidental, it soon became 
clear that black holes do indeed behave as thermodynamic systems. The crucial 
step in this realization was Hawking's remarkable discovery of 1974 that quantum 
processes allow a black hole to emit a thermal flux of particles. It is thus possible 
for a black hole to be in thermal equilibrium with other thermodynamic systems. 
The laws of black-hole mechanics, therefore, are nothing but a description of the 
thermodynamics of black holes. 



5. 5. 1 Preliminaries 

We begin our discussion of the four laws by collecting a few important results from 
preceding chapters; these will form the bulk of the mathematical framework required 
for the derivations. 

Let y a = (v,6 A ) be coordinates on the event horizon. The advanced-time coor- 
dinate v is a non-affine parameter on the horizon's null generators, and 6 A spans 
the two-dimensional space transverse to the generators. The vectors 

are tangent to the horizon; they satisfy £«e5 = = £^e% and = t a + fiif^ a is a 
Killing vector. We complete the basis by introducing an auxiliary null vector TV", 
normalized by N a ^ a = —1. This basis gives us the completeness relations (Sec. 3.1) 

where a AB is the inverse of gab = 9ai3^A e B- The determinant of the transverse 
two-metric will be denoted a. 

The vectorial surface element on the event horizon can be expressed as (Sec. 3.2) 

dY, a = -£ a dSdv, (5.5.2) 

where dS = \fa d 2 9. The two-dimensional surface element on a cross section v = 
constant is 

dS afi = 2£ [a JVa dS. (5.5.3) 

We shall denote such a cross section by JF. 

Finally, we will need Raychaudhuri's equation for the congruence of null gener- 
ators, expressed in a form that does not require the parameter to be affine. This 
was worked out in Sec. 2.6, Problem 8, and the answer is 

^. = K e-±6 2 - o^o af) - 8nT a ^ a f; (5.5.4) 

the last term would normally involve the Ricci tensor, but we have used the Einstein 
field equations to write it in terms of the stress-energy tensor. We recall that 
is the fractional rate of change of the congruence's cross-sectional area: 6 = 
(dS)- 1 d(dS)/dv. 
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5.5.2 Zeroth law 

The zeroth law of black- hole mechanics states that the surface gravity of a stationary 
black hole is uniform over the entire event horizon. We saw in Sec. 5.3.10 that this 
statement is indeed true for the specific case of a Kerr black hole, but the scope of 
the zeroth law is much wider: the black hole need not be isolated, and its metric 
need not be the Kerr metric. 

To prove that k is uniform on the event horizon, we need to establish that (i) 
k is constant along the horizon's null generators, and (ii) k does not vary from 
generator to generator. We will prove both statements in turn, starting with 

K 2 = -^r ;/3 £a;/3 (5-5.5) 

as our definition for the surface gravity. (We saw in Sec. 5.2.4 that this relation is 
equivalent to £, a .^ = We shall need the identity 

£,a;ixv = Raixvf)^ i (5.5.6) 

which is satisfied by any Killing vector £ a . (This was derived in Sec. 1.13, Problem 
90 

We differentiate Eq. (5.5.5) in the directions tangent to the horizon. (Because 
k is defined only on the event horizon, its normal derivative does not exist.) Using 
Eq. (5.5.6), we obtain 

2KK a = -e^iW^- (5-5-7) 
The fact that k is constant along the generators follows immediately from this: 

K a £ a = 0. (5.5.8) 

We must now examine how k changes in the transverse directions. Equation (5.5.7) 
implies 

2nn a e°A = — Rpvafj e^£^, 

and we would like to show that the right-hand side is zero. Let us first assume 
that the event horizon is geodesically complete, so that it contains a bifurcation 
two-sphere, at which £ a = 0. Then the last equation implies that = at the 

bifurcation two-sphere. Because constant on the null generators (Sec. 5.7, 

Problem 6), we have that K a eJ = on every cross section v = constant of the 
event horizon. This shows that the value of k does not change from generator to 
generator, and we conclude that k is uniform over the entire event horizon. 

It is easy to see that the property n^e^ = must be independent of the existence 
of a bifurcation two-sphere. Consider two stationary black holes, identical in every 
respect in the future of v = (say), but different in the past, so that only one of them 
possesses a bifurcation two-sphere. (We imagine that the first black hole has existed 
forever, and that the second black hole was formed prior to v = by gravitational 
collapse; the second black hole is stationary only for v > 0.) Our proof that K ta e^ = 
on all cross sections v = constant of the event horizon applies to the first black 
hole. But since the spacetimes are identical for v > 0, the property K a eJ = 
must apply also to the second black hole. Thus, the zeroth law is established for all 
stationary black holes, whether or not they are geodesically complete. 

It is clear that the relation £ M; "i? M „a/3 e 2£^ = must hold everywhere on a sta- 
tionary event horizon, but it is surprisingly difficult to prove this. In their original 
discussion, Bardeen, Carter, and Hawking establish this identity by using the Ein- 
stein field equations and the dominant energy condition (Sec. 5.7, Problem 7). This 
restriction was lifted in a 1996 paper by Racz and Wald. 
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Figure 5.17: A spacelike hypersurface in a black-hole spacetime. 



5. 5. 3 Generalized Smarr formula 

Before moving on to the first law, we generalize Smarr's formula (Sec. 5.3.12) that 
relates the black-hole mass M to its angular momentum J, angular velocity , 
surface gravity ft, and surface area A. In the present context, the black hole is 
stationary and axially symmetric, but it is not assumed to be a Kerr black hole. 

Our starting point is the Komar expressions for total mass and angular momen- 
tum (Sec. 4.3.3): 

M = £v a t? dS aP , J = ^-jv a 4J 3 dS aP , 

where the integrations are over a closed two-surface at infinity. We consider a space- 
like hypersurface £ extending from the event horizon to spatial infinity (Fig. 5.17). 
Its inner boundary is , a two-dimensional cross section of the event horizon, and 
its outer boundary is S. Using Gauss' theorem, as was done in Sec. 4.3.3 (but 
without the inner boundary), we find that M and J can be expressed as 

M = M H + 2 j Jr af} - ^Tg a ^jn a ^Vhd 3 y (5.5.9) 



and 



3 = J " ~ L { Tap ~ \ T S^n a ^y/h^y, (5.5.10) 

where Mh and Jh are the black-hole mass and angular momentum, respectively. 
They are given by surface integrals over Jif: 

M H = ~f V a t?dS aP (5.5.11) 
tin J 



and 



Jh = t^(1 V a ^dS aP , (5.5.12) 



where dS a p is the surface element of Eq. (5.5.3). The interpretation of Eqs. (5.5.9) 
and (5.5.10) is clear: the total mass M (angular momentum J) is given by a con- 
tribution Mh (Jh) from the black hole, plus a contribution from the matter dis- 
tribution outside the hole. If the black hole is in vacuum, then M = Mh and 
J = Jh- 
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Smarr's formula emerges after a few simple steps. Using Eqs. (5.5.3), (5.5.11) 
and (5.5.12), we have 

M H - 2 n H j H = ~<f V a (tP + n H ^) dSa/3 



= -hi e^ a N P ds 



4tt jx, 



= dS, 

4-7T 



dS 



3V 



where we have used the relation £ a N a = — 1 and the fact that k is constant over 
JF. The last integration gives the horizon's surface area, and we arrive at 



kA 

M h = 2U h J h + — , (5.5.13) 

47T 



the generalized Smarr formula. 



5.5.4 First law 

We consider a quasi-static process during which a stationary black hole of mass M, 
angular momentum J, and surface area A is taken to a new stationary black hole 
with parameters M + 5M, J + 5J, and A + 5A. The first law of black-hole mechanics 
states that the changes in mass, angular momentum, and surface area are related 
by 

6M = ^-6A + tt H 6J. (5.5.14) 

If the initial and final black holes are in vacuum, then they are Kerr black holes 
by virtue of the uniqueness theorems, and a derivation of Eq. (5.5.14) was already 
presented in Sec. 5.3.13. That derivation, however, relied heavily on the details of 
the Kerr metric. We shall now present a derivation that is quite insensitive to those 
details. In particular, we shall not assume that the black hole is in vacuum. 

We suppose that a black hole, initially in a stationary state, is perturbed by a 
small quantity of matter described by the (infinitesimal) stress-energy tensor T a p. 
As a result, the mass and angular momentum of the black hole increase by amounts 
(Sec. 4.3.4) 

SM = - f T a gt^dT, a (5.5.15) 
Jh 

and 

SJ= f T a g4^dS a , (5.5.16) 
Jh 

where the integrations are over the entire event horizon. We will be working to 
first order in the perturbation T a p, keeping t a , <f> a , and dT, a at their unperturbed 
values. We assume that at the end of the process, the black hole is returned to 
another stationary state. 

Substituting the surface element of Eq. (5.5.2) into Eqs. (5.5.15) and (5.5.16), 
we find 

5M-n H SJ = [ T aP [f + H /)r dSdv 
Jh 

= fdv<f T a ^ a ^ dS. 
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To work out the integral, we turn to Raychaudhuri's equation, Eq. (5.5.4). Because 
9 and a a p are quantities of the first order in T a p, it is appropriate to neglect the 
quadratic terms, so that we have 

and 



6M - tt H 6J = - 




Because the black hole is stationary both before and after the perturbation, 6(v = 
±oo ) = 0, and the boundary terms vanish. Using the fact that 6 is the fractional 
rate of change of the congruence's cross-sectional area, we obtain 



6M - tt H 6J = 




where 5A is the change in the black hole's surface area. This is Eq. (5.5.14), the 
statement of the first law of black-hole mechanics. 

5.5.5 Second law 

The second law of black-hole mechanics states that if the null energy condition is 
satisfied, then the surface area of a black hole can never decrease: 5A > 0. This 
area theorem was established by Stephen Hawking in 1971. 

Glossing over various technical details, the area theorem follows directly from 
the focusing theorem (Sec. 2.4.5) and Penrose's observation that the event horizon 
is generated by null geodesies with no future end points. This statement means 
that the generators of the event horizon can never run into caustics. (A generator 
can enter the horizon at a caustic point, but once in H it will never again meet 
another caustic.) The focusing theorem then implies that 6, the expansion of the 
congruence of null generators, must be positive, or zero, everywhere on the event 
horizon. To see this, suppose that 6 < for some of the generators. The focusing 
theorem then guarantees that these generators will converge into a caustic, at which 
6 = —oo. We have a contradiction, and we must conclude that 6 > everywhere 
on the event horizon. This implies that the horizon's surface area will not decrease, 
which is just the statement of the area theorem. (The fact that new generators can 
enter the event horizon contributes even further to the growth of its area.) 

5. 5. 6 Third law 

The third law of black-hole mechanics states that if the stress-energy tensor is 
bounded and satisfies the weak energy condition, then the surface gravity of a black 
hole cannot be reduced to zero within a finite advanced time. A precise formulation 
of this law was given by Werner Israel in 1986. 

We have seen that a black hole of zero surface gravity is an extreme black hole. 
(Recall that a Kerr black hole is extremal if a = M; for a Reissner-Nordstrom black 
hole, the condition is \Q\ = M.) An equivalent statement of the third law is therefore 
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that under the stated conditions on the stress-energy tensor, it is impossible for a 
black hole to become extremal within a finite advanced time. 

The proof of the third law is rather involved, and we will not attempt to go 
through it here. Instead of presenting a proof, we will illustrate the fact that the 
third law is essentially a consequence of the weak energy condition. 

For the purpose of this discussion, we need a black-hole spacetime which is 
sufficiently dynamical that it has the potential of becoming extremal at a finite 
advanced time v. A simple choice is the charged generalization of the ingoing 
Vaidya spacetime, whose metric is given by 

ds 2 = -fdv 2 +2dvdr + r 2 dn 2 , (5.5.17) 

with 

/=l-^ + ^M. (5.5.18) 

This metric describes a black hole whose mass m and charge q change with time 
because of irradiation by charged null dust, a fictitious form of matter. This inter- 
pretation is confirmed by inspection of the stress-energy tensor, 

T ap =TZ P st +T^, (5.5.19) 

where 

i& -*<■!•. '-si? !("-£) <"- 20 > 

is the contribution from the null dust (l a = —d a v is a null vector), and 

T em ^=Pdiag(-l, -1,1,1), P =^a ( 5 - 5 - 21 ) 

is the contribution from the electromagnetic field. The spacetime of Eqs. (5.5.17) 
and (5.5.18) will produce a violation of the third law if m(vo) = q(vo) for some 
advanced time v < oo. 

An essential aspect of this discussion is the weak energy condition (Sec. 2.1), 
which states that the energy density measured by an observer with four-velocity 
u a = dx a /dr will always be positive: 

TnfjVpV? > 0. 

Here, T a @ is the stress-energy tensor of Eq. (5.5.19). If our observer is restricted to 
move in the radial direction only, then T a pu a u^ = p(dv/dr) 2 + P. Because dv/dr 
can be arbitrarily large, the weak energy condition requires p > 0. In particular, p 
must be positive at the apparent horizon, r = r+(i>), where r + = m + (m 2 — q 2 ) 1 ^ 2 ■ 
This gives us the following condition: 

4-Kr + 3 p(r + ) = mm - qq + (m 2 - q 2 ) 1/2 m > 0, (5.5.22) 

where an overdot indicates differentiation with respect to v. 

Let us imagine a situation in which the black hole becomes extremal at a finite 
advanced time vo- This means that A(vo) = 0, where A(v) = m(v) — q(v). Because 
the black hole was not extremal before v = vo, we have that A(i>) > for v < vo, 
and A(v) must be decreasing as v approaches vq. However, Eq. (5.5.22) implies 

m(v ) A(v ) > 0, 

according to which A(v) must be increasing. We have a contradiction, and we 
conclude that the weak energy condition prevents the black hole from ever becoming 
extremal at a finite advanced time. 
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5.5.7 Black-hole thermodynamics 



The four laws of black-hole mechanics bear a striking resemblance to the laws of 
thermodynamics, with n playing the role of temperature, A that of entropy, and 
M that of internal energy. Hawking's discovery that quantum processes give rise 
to a thermal flux of particles from black holes implies they do indeed behave as 
thermodynamic systems. Black holes have a well-defined temperature, which as a 
matter of fact is proportional to the hole's surface gravity: 



The zeroth law is therefore a special case of the corresponding law of thermodynam- 
ics, which states that a system in thermal equilibrium has a uniform temperature. 
The first law, when recognized as a special case of the corresponding law of ther- 
modynamics, implies that the black-hole entropy must be given by 




(5.5.24) 



The second law is therefore also a special case of the corresponding law of thermody- 
namics, which states that the entropy of an isolated system can never decrease. In 
this regard it should be noted that Hawking radiation actually causes the black-hole 
area to decrease, in violation of the area theorem. (The radiation's stress-energy 
tensor does not satisfy the null energy condition.) However, the process of black- 
hole evaporation does not violate the generalized second law, which states that the 
total entropy, the sum of radiation and black-hole entropies, does not decrease. 

The fact that black holes behave as thermodynamic systems reveals a deep 
connection between such disparate fields as gravitation, quantum mechanics, and 
thermodynamics. This connection is still poorly understood today. 



During the preparation of this chapter I have relied on the following references: 
Bardeen, Carter, and Hawking (1973); Carter (1979); Chandrasekhar (1983); Hay- 
ward (1994); Israel (1986a); Israel (1986b); Sullivan and Israel (1980); Misner, 
Thorne, and Wheeler (1973); Wald (1984); and Wald (1992). 
More specifically: 

The term "trapping horizon", used in Sees. 5.1.7 and 5.4.1, was introduced by 
Sean Hayward in his 1994 paper. The various definitions for the surface gravity 
(Sees. 5.2.4, 5.3.10, 5.4.2, and 5.5.2) are taken from Sec. 12.5 of Wald (1984). The 
presentation of the Kerr black hole is based on Sees. 33.1-5 of Misner, Thorne, 
and Wheeler, and Sees. 57 and 58 of Chandrasekhar. The definitions for black- 
hole region and event horizon are taken from Sec. 12.1 of Wald (1984); trapped 
surfaces and apparent horizons are defined in Wald's Sec. 9.5 and 12.2, respectively. 
Penrose's theorem on the structure of the event horizon (Sec. 5.4.1) is very nicely 
discussed in Sec. 34.4 of Misner, Thorne, and Wheeler. Section 9.5 of Wald (1984) 
provides a thorough discussion of the singularity theorems. The general properties 
of stationary black holes (Sec. 5.4.2) are discussed in Sec. 12.3 of Wald (1984) 
and Sec. 6.3.1 of Carter. An overview of the uniqueness theorems of black-hole 
spacetimes (Sec. 5.4.3) can be found in Sec. 12.3 of Wald (1984) and Sec. 6.7 of 
Carter. In Sec. 5.5, the proofs of the zeroth and first laws are taken from Wald's 
1992 Erice lectures. The generalized Smarr formula is derived in Sec. 6.6.1 of 
Carter. The discussion of the second law is adapted from Sec. 6.1.2 of Carter. The 
final form of the third law was given in Israel (1986b); my discussion is based on 




K. 



(5.5.23) 



5.6 Bibliographical notes 



5.7 Problems 



165 



Sullivan and Israel. Finally, in the problems below, the material on the Majumdar- 
Papapetrou solution is taken from Sec. 113 of Chandrasekhar, the description of 
null-dust collapse is adapted from Israel (1986a), and the alternative derivation of 
the zeroth law is based on Bardeen, Carter, and Hawking. 

5.7 Problems 

1. The metric of an extreme (Q = ±M) Reissner- Nordstrom black hole is given 
by 

d8 2 = -(l-M.) dt?+(l-—\ dr 2 +r 2 dSl 2 . 



r J \ r J 

a) Find an appropriate set of Kruskal coordinates for this spacetime. 

b) Show that the region r < M does not contain trapped surfaces. 

c) Sketch a Penrose-Carter diagram for this spacetime. 

d) Find a coordinate transformation that brings the metric to the form 

ds 2 = -(l + ^j \t 2 + (i + y) 2 {dx 2 +dy 2 + dz 2 ), 

where f 2 = x 2 +y 2 + z 2 . Show that in these coordinates, the electromag- 
netic field tensor can be generated from the vector potential A a dx a = 
=f(1 + M/f) _1 dt, in which the upper (lower) sign gives rise to a positive 
(negative) electric charge. 

e) Show that the metric 

ds 2 = -$- 2 dt 2 + $ 2 (dx 2 + dy 2 + dz 2 ) 

and the vector potential A a dx a = =f$ _1 dt produce an exact solution 
to the Einstein-Maxwell equations provided that $(x) satisfies Laplace's 
equation V 2 3> = 0. Here, V 2 is the usual Laplacian operator of three- 
dimensional flat space, and x = (x,y,z). This metric is known as the 
Majumdar-Papapetrou solution. Prove that if the spacetime is asymp- 
totically flat, then the total charge Q and the ADM mass M are re- 
lated by Q = ±M. Finally, find an expression for $(a;) that corre- 
sponds to a collection of N black holes situated at arbitrary positions x n 
(n=l,2,---,JV). 

A black hole is formed by the gravitational collapse of null dust. During the 
collapse, the metric is given by an ingoing Vaidya solution with mass function 
m(v) = v/16. Spacetime is assumed to be flat before the collapse (v < 0), and 
after the collapse (v > i>o), the metric is given by a Schwarzschild solution 
with mass m = m(vo) = vo/16- We want to study various properties of this 
spacetime. 

a) Show that in the interval < v < vo, outgoing light rays are described by 

the parametric equations 

r(A) = cAe" A , v(X) = 4c (1 + A) e _A , 

where c is a constant. Show that v = 4r also describes an outgoing light 
ray. Plot a few of these curves in the (v,r) plane, using both positive 
and negative values of c. Plot also the position of the apparent horizon. 

b) Find the parametric equations that describe the event horizon. 
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c) Prove that the curvature singularity at r = is naked, in the sense that it 
is visible to observers at large distances. Prove also that at the moment 
it is visible, the singularity is massless. [It is generally true that the 
central singularity of a spherical collapse must be massless if it is naked. 
This was established by Lake (1992).] 

3. In this problem we have a closer look at the instability of black-hole tunnels, 
a topic that was mentioned briefly in Sees. 5.2.3 and 5.3.9. We will see that 
the instability is caused by the pathological behaviour of the ingoing branch 
(v = oo) of the inner horizon (r = r_). For reasons that will become clear, we 
shall call this the Cauchy horizon of the black-hole spacetime. For simplicity, 
we shall restrict our attention to the Reissner-Nordstrom (RN) spacetime. 
[The physics of the Cauchy-horizon instability was this author's Ph.D. topic; 
see Poisson and Israel (1990). The book by Burko and Ori (1997) presents a 
rather complete review of this fascinating part of black-hole physics.] 

a) Consider an event P located anywhere in the future of the Cauchy horizon. 

Argue that the conditions at P are not uniquely determined by initial 
data placed on a spacelike hypersurface £ located outside the black hole. 
Then argue that the Cauchy horizon is the boundary of the region of 
spacetime for which the evolution of this data is unique. (This region 
is called the domain of dependence of S, and we say that the Cauchy 
problem of general relativity is well posed in this region. The Cauchy 
horizon is the place at which the evolution ceases to be uniquely deter- 
mined by the initial data; the Cauchy problem breaks down. In effect, 
the predictive power of the theory is lost at the Cauchy horizon.) 

b) Consider a test null fluid in RN spacetime, with stress-energy tensor 

rpa/3 _ pi^iP^ w here p is the energy density and l a = —d a v the four- 
velocity. The fluid moves parallel to the Cauchy horizon, along ingoing 
null geodesies. Prove that p must be of the form 



where L(v) is an arbitrary function of advanced time v. Show that if a 
finite quantity of energy is to enter the black hole, then L — > as v — > oo. 
(How fast must L vanish?) Typically, radiative fields outside black holes 
decay in time according to an inverse power law (Price 1972). We shall 
therefore take L(v) ~ v~ p as v — > oo, with p larger than, say, 2. 
c) Consider now a free-falling observer inside the RN black hole. This ob- 
server moves in the outward radial direction, encounters the null dust, 
and measures its energy density to be T a pu a u^, where u a is the ob- 
server's four velocity. Show that as the observer crosses the Cauchy 
horizon, 

where E = —u a t a and k_ was defined in Sec. 5.2.2. Conclude that the 
measured energy density diverges at the Cauchy horizon, even though the 
total amount of energy entering the hole is finite. This is the pathology 
of the Cauchy horizon, which ultimately is responsible for the instability 
of black- hole tunnels. 

4. The equations governing geodesic motion in the Kerr spacetime were given 
without justification in Sec. 5.3.6. Here we provide a derivation, which is 
valid both for timelike and null geodesies. [The general form of the geodesic 
equations can be found in Sec. 33.5 of Misner, Thorne, and Wheeler (1973).] 
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a) By definition, a Killing tensor field £ a p is one which satisfies the equa- 

tion £(a/3; 7 ) = 0. Show that if £ a p is a Killing tensor and u a satisfies 
the geodesic equation (u a puP = 0), then £ a i3U a uP is a constant of the 
motion. 

b) Verify that 

£ a p = A k {a l p) + r 2 g a p 

is a Killing tensor of the Kerr spacetime. Here, k a and l a are the null 
vectors defined in Sec. 5.3.6. 

b) Write the relations E = —t a u a and L = 4> a u a explicitly in terms of 

u a = (t,f,6,<f>). Then invert these relations to obtain the equations for 
t and 4>. [Hint: Make sure to involve the inverse metric] 

c) The Carter constant £1 is defined by 

i af) u a u^ =£+ (L-aE) 2 . 

By working out the left-hand side, derive the equation for r. [Hint: 
Express k a and l a in terms of the Killing vectors, and then £ a p u a u^ in 
terms of E, L, and f 2 .} 

d) Finally, use the normalization condition g a pu a u^ = —( (where ( = 1 for 

timelike geodesies and £ = for null geodesies) to obtain the equation 
for (9. 

5. Let l a be a null, geodesic vector field in flat spacetime. With this vector and 
an arbitrary scalar function H we construct a new metric tensor g a p: 

g a p = Va/3 + Hl a lp, 

where T) a p is the Minkowski metric and l a = t] a pl^ ■ Such a metric is called a 
Kerr-Schild metric. 

a) Show that l a is null with respect to both metrics. 

b) Show that g a ^ = n"^ — Hl a l^ is the inverse metric. 

c) Prove that l a = g a pl^ and l a = g a ^lp- Thus, indices on the null vector 

can be lowered and raised with either metric. 

d) Calculate the Christoffel symbols for g a p. Show that they satisfy the 

relations 

^ r "«/3 = _ 2 HtoJp, lliT %p = 2 
where H = H^l". 

e) Prove that l a .^P = 0. Thus, l a is a geodesic vector field in both metrics. 

f) Prove that the component R a pl a l^ of the Ricci tensor vanishes for any 

choice of function H . 

6. Complete the discussion of the zeroth law by proving that «, a is constant 
along the null generators of a stationary event horizon. 

7. In this problem we provide an alternative derivation of the zeroth law of 
black-hole mechanics. This derivation is based on the original presentation 
by Bardeen, Carter, and Hawking (1973); it uses the Einstein field equations 
and the dominant energy condition. 
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a) We have seen that the vector is tangent to the null generators of the 

event horizon. It satisfies the following properties: (i) £ a is null on the 
horizon; (ii) = k£ u on the horizon; (iii) £ a is a Killing vector; and 

(vi) the congruence of null generators has zero expansion, shear, and 
rotation. Use these facts to infer 

£a;/3 = (KN a +C A eAa)£l3 ~ €a {uNp + C B e B j}), 

where c A = o AB £ }a -pN a e l3 B . This relation holds on the horizon only. 

b) Prove that the gradient of the surface gravity (in the directions tangent 

to the horizon) is given by 

K, a = -R a ^sfN^ s - (a AB c A c B )Z a . 

This immediately implies that k is constant on each generator: K, a £ a = 
0. 

c) Show that the result of part b) also implies 

K , a e% = -R aP e a A ^ - o BC R aPlS e^e^. 

d) The quantities Bab = £« ; /3 e^e B and their tangential derivatives must all 

vanish on the horizon. Use this observation to derive 

Ra^e^el? =0. 

This relations holds on the horizon only. 

e) Collecting the results of parts c) and d), use the Einstein field equations 

to write 

where j a = —T a ^P represents a flux of momentum across the horizon. 

f) The dominant energy condition states that j a should be either timelike or 

null, and future directed. Use this, together with the stationary condition 
T a p£, a t,P = 0, to prove that j a must be parallel to £ a . Under these 
conditions, therefore, 

= 0, 

and the zeroth law is established. 

8. The unique solution to the Einstein-Maxwell equations describing an isolated 
black hole of mass M, angular momentum J = aM, and electric charge Q is 
known as the Kerr-Newman solution; it was discovered by Newman et al. in 
1965. The Kerr-Newman metric can be expressed as 

ds 2 = - ^ dt 2 + ^ sin 2 6(d4>-Ljdt) 2 + ?- dr 2 + p 2 d6 2 , 

where p 2 = r 2 +a 2 cos 2 6, A = r 2 -2Mr + a 2 +Q 2 , S = (r 2 +a 2 ) 2 -a 2 A sin 2 6, 
and w = a{r 2 + a 2 — A)/S. The metric comes with a vector potential 

A a dx a = (dt -a sin 2 6d4>). 

When Q = 0, A a = and this reduces to the Kerr solution. 

a) Find expressions for r+, the radius of the event horizon, and fin, the 
angular velocity of the black hole. 
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b) Prove that the vector field 



V 2 -\- a 2 CL 

l a d a = — - — d t - d r + — dtj, 
is tangent to a congruence of ingoing null geodesies. Prove also that 

,.2 I „2 



and 



/I 



dr 



tp = <p + 



are constant on each member of the congruence, 
c) Show that in the coordinates (v,r,0,ip), the Kerr-Newman metric takes 



the form 

J„2 _ 



A - a 2 sin 2 6 , , „ , , „ r 2 + a 2 - A . 2 „ , , , 
5 dv + 2 dvdr — 2a 5 sin dvdtp 



- 2a sin 2 6 drdtp + sin 2 6 dtp 2 + p 2 d6 2 . 
p 2 

Find an expression for A a is this coordinate system, 
d) Show that the vectors 



and 



Z a d a = d v + n H d i ,, 

a 2 sin 2 6 



eg d a = d 6 



N a d n = 



r + 2 + a 2 



2(r + 2 + a 2 cos 2 6) v r + 2 +a 2 cos 2 6 
a 2r + 2 + a 2 (l + cos 2 6i) 



2(r + 2 + a 2 ) r + 2 + a 2 cos 2 



form a good basis on the event horizon. In particular, prove that they 
give rise to the completeness relations g a ^ = —^ a N^ — N a ^+a AB e^e^, 
where a AB is the inverse of gab = g a /3 e A e B- 

e) Prove that the surface gravity of a Kerr-Newman black hole is given by 

r + - M 
r + A + gt 

Prove also that the hole's surface area is 

A = 4n(r + 2 +a 2 ). 

f) Compute the black-hole mass M H and the black-hole angular momen- 

tum Jh of a Kerr-Newman black hole. (These quantities are defined in 
Sec. 5.5.3.) Make sure that your results are compatible with the following 
expressions: 



M H = 



r + 2 + a 2 
2r+ 



Q2 

1 arctan(a/r + ) 

ar + v ' 



and 



J H = a 



r + 2 + a 2 
2r+ 



1 + 



91 

2a 2 



t_i_ 2 + a 2 

1 arctan(o/r_i_) 

ar + y ' 



Verify that these expressions satisfy the generalized Smarr formula. 
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g) Derive the following alternative version of Smarr's formula: 

kA 

M = 2U H J + — + $ H Q, 
4n 

where 



,^r+ r + 2 + a 2 

is the electrostatic potential at the horizon. 

h) Consider a quasi-static process during which a stationary black hole of 
mass M, angular momentum J, and electric charge Q is taken to a new 
stationary black hole with parameters M + 6M, J + 6 J, and Q + 6Q. 
Prove that during such a transformation, the hole's surface area A will 
change by an amount 6 A given by 

SM = ^- 6A + tt H 6 J + § H 6Q. 

This is the first law of black-hole mechanics for charged, rotating black 
holes. 

9. Consider a quasi-static process during which the surface area of a black hole 
changes. (By quasi-static we mean that dA/dv is very small.) Derive the 
Hawking-Hartle formula, 



in which = dx a /dv is tangent to the null generators of the event horizon, 
and a a j3 is their shear tensor. The second term within the integral represents 
the effect of accreting matter on the surface area. The first term represents 
the effect of gravitational radiation flowing across the horizon. 
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